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<221> misc_feature 

<222> Complement ( (883) .. (2608) ) 

<223> promoter region of Rubisco small subunit gene of 
Arabidopsis thaliana (PSSU) 

<220> 

<221> misc_feature 

<222> Complement { (2658) (3031) ) 

<223> 3' nos: 3' untranslated region containing the 

polyadenylation signal of the nopaline synthase 
gene of Agrobacterium T-DNA 

<220> 

<221> misc_feature 

<222> Complement ( (3032) .. (3367) ) 

<223> barnase: region coding for barnase 

<220> 

<221> misc_feature 

<222> Complement { (3368) .. (4876) ) 

<223> PTA29: promoter region of TA29 gene of Nicotiana 
tabacum 



<220> 

<221> misc_feature 

<222> Complement ( (4922) .. (4946) ) 

<223> LB: T-DNA left border 



<400> 1 

aattacaacg gtatatatcc tgccagtact 
gtcgataaga aaaggcaatt tgtagatgtt 
atatttattg ataaaataac aagtcaggta 
gatgcaagtt taaattcaga aatatttcaa 
tagatgaaag actgagtgcg atattatgtg 
agctcatcgg gggatcctag acgcgtgaga 
gggcggtacc ggcaggctga agtccagctg 
cttgaagccg gccgcccgca gcatgccgcg 
cacgctcggg tcgttgggca gcccgatgac 
cagggacttc agcaggtggg tgtagagcgt 
ggagacgtac acggtcgact cggccgtcca 
cgcgtaggcg atgccggcga cctcgccgtc 
ccgcagacgg acgaggtcgt ccgtccactc 
gaccgtgctt gtctcgatgt agtggttgac 
ggtggcacgg cggatgtcgg ccgggcgtcg 
gtgtgactga ggtttggtct agtgctttgg 
aacaagcttt ggagtgatcg gagggtctag 
acaccgttgg attttgagtg tggatatgtg 



cggccgtcga actcggccgt cgagtacatg 60 
aattcccatc ttgaaagaaa tatagtttaa 120 
ttatagtcca agcaaaaaca taaatttatt 180 
taactgatta tatcagctgg tacattgccg 240 
taatacataa attgatgata tagctagctt 3 00 
tcagatctcg gtgacgggca ggaccggacg 3 60 
ccagaaaccc acgtcatgcc agttcccgtg 420 
gggggcatat ccgagcgcct cgtgcatgcg 480 
agcgaccacg ctcttgaagc cctgtgcctc 540 
ggagcccagt cccgtccgct ggtggcgggg 600 
gtcgtaggcg ttgcgtgcct tccaggggcc 660 
cacctcggcg acgagccagg gatagcgctc 720 
ctgcggttcc tgcggctcgg tacggaagtt 780 
gatggtgcag accgccggca tgtccgcctc 840 
ttctgggtcc attgttcttc tttactcttt 900 
tcatctatat ataatgataa caacaatgag 960 
gatacatgag attcaagtgg actaggatct 1020 
tgaggttaat tttacttggt aacggccaca 1080 




aaggcctaag gagaggtgtt gagaccctta tcggcttgaa ccgctggaat aatgccacgt 1140 
ggaagataat tccatgaatc ttatcgttat ctatgagtga aattgtgtga tggtggagtg 1200 
gtgcttgctc attttacttg cctggtggac ttggcccttt ccttatgggg aatttatatt 1260 
ttacttacta tagagctttc ataccttttt tttaccttgg atttagttaa tatataatgg 1320 
tatgattcat gaataaaaat gggaaatttt tgaatttgta ctgctaaatg cataagatta 1380 
ggtgaaactg tggaatatat atttttttca tttaaaagca aaatttgcct tttactagaa 1440 
ttataaatat agaaaaatat ataacattca aataaaaatg aaaataagaa ctttcaaaaa 1500 
acagaactat gtttaatgtg taaagattag tcgcacatca agtcatctgt tacaatatgt 1560 
tacaacaagt cataagccca acaaagttag cacgtctaaa taaactaaag agtccacgaa 162 0 
aatattacaa atcataagcc caacaaagtt attgatcaaa aaaaaaaaac gcccaacaaa 1680 
gctaaacaaa gtccaaaaaa aacttctcaa gtctccatct tcctttatga acattgaaaa 1740 
ctatacacaa aacaagtcag ataaatctct ttctgggcct gtcttcccaa cctcctacat 1800 
cacttcccta tcggattgaa tgttttactt gtaccttttc cgttgcaatg atattgatag 1860 
tatgtttgtg aaaactaata gggttaacaa tcgaagtcat ggaatatgga tttggtccaa 1920 
gattttccga gagctttcta gtagaaagcc catcaccaga aatttactag taaaataaat 1980 
caccaattag gtttcttatt atgtgccaaa ttcaatataa ttatagagga tatttcaaat 2040 
gaaaacgtat gaatgttatt agtaaatggt caggtaagac attaaaaaaa tcctacgtca 2100 
gatattcaac tttaaaaatt cgatcagtgt ggaattgtac aaaaatttgg gatctactat 2160 
atatatataa tgctttacaa cacttggatt tttttttgga ggctggaatt tttaatctac 2220 
atatttgttt tggccatgca ccaactcatt gtttagtgta atactttgat tttgtcaaat 2280 
atatgtgttc gtgtatattt gtataagaat ttctttgacc atatacacac acacatatat 2340 
atatatatat atatattata tatcatgcac ttttaattga aaaaataata tatatatata 2400 
tagtgcattt tttctaacaa ccatatatgt tgcgattgat ctgcaaaaat actgctagag 2460 
taatgaaaaa tataatctat tgctgaaatt atctcagatg ttaagatttt cttaaagtaa 2520 
attctttcaa attttagcta aaagtcttgt aataactaaa gaataataca caatctcgac 2580 
cacggaaaaa aaacacataa taaatttgaa tttcgaccgc ggtacccgga attcgagctc 2640 
ggtacccggg gatcttcccg atctagtaac atagatgaca ccgcgcgcga taatttatcc 2700 
tagtttgcgc gctatatttt gttttctatc gcgtattaaa tgtataattg cgggactcta 2760 
atcataaaaa cccatctcat aaataacgtc atgcattaca tgttaattat tacatgctta 2820 
acgtaattca acagaaatta tatgataatc atcgcaagac cggcaacagg attcaatctt 2880 
aagaaacttt attgccaaat gtttgaacga tctgcttcgg atcctctaga gccggaaagt 2940 
gaaattgacc gatcagagtt tgaagaaaaa tttattacac actttatgta aagctgaaaa 3000 
aaacggcctc cgcaggaagc cgtttttttc gttatctgat ttttgtaaag gtctgataat 3060 
ggtccgttgt tttgtaaatc agccagtcgc ttgagtaaag aatccggtct gaatttctga 3120 
agcctgatgt atagttaata tccgcttcac gccatgttcg tccgcttttg cccgggagtt 3180 
tgccttccct gtttgagaag atgtctccgc cgatgctttt ccccggagcg acgtctgcaa 3240 
ggttcccttt tgatgccacc cagccgaggg cttgtgcttc tgattttgta atgtaattat 3300 
caggtagctt atgatatgtc tgaagataat ccgcaacccc gtcaaacgtg ttgataaccg 3360 
gtaccatggt agctaatttc tttaagtaaa aactttgatt tgagtgatga tgttgtactg 3420 
ttacacttgc accacaaggg catatataga gcacaagaca tacacaacaa cttgcaaaac 3480 
taacttttgt tggagcattt cgaggaaaat ggggagtagc aggctaatct gagggtaaca 3540 
ttaaggtttc atgtattaat ttgttgcaaa catggactta gtgtgaggaa aaagtaccaa 3600 
aattttgtct caccctgatt tcagttatgg aaattacatt atgaagctgt gctagagaag 3660 
atgtttattc tagtccagcc acccacctta tgcaagtctg cttttagctt gattcaaaaa 3720 
ctgatttaat ttacattgct aaatgtgcat acttcgagcc tatgtcgctt taattcgagt 3780 
aggatgtata tattagtaca taaaaaatca tgtttgaatc atctttcata aagtgacaag 3840 
tcaattgtcc cttcttgttt ggcactatat tcaatctgtt aatgcaaatt atccagttat 3900 
acttagctag atatccaatt ttgaataaaa atagctcttg attagtaaac cggatagtga 3960 
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caaagtcaca tatccatcaa acttctggtg 
taaaatttaa attgggacac ataaatagcc 
cagattgtta catggaaaac aaaaagtcct 
ctatcgagag atagattgaa agaagtgcag 
tctaaattaa ttgcattcgc taaccaaaaa 
tattttttgg cccttttttt atggtccaaa 
tttaattatt tttttactac agtgcccttg 
atgtttatgt gaagaaatag taaaggttaa 
aaatgtgaat ttcttaatct gtgtgaaaac 
aaagtatata aatatatatt tggaagcgac 
cctaaaaaca gcatatggta gtttctaggg 
caacaagtat caatacatat gatttacacc 
taataaagaa ttaatccaaa tagcctccca 
aatgtatatt atatgcataa tttatatatt 
taatctatgt atatggttag aaaaagtaaa 
atccctaata taatcgcgac ggatccccgg 
catttacaat tgaatatatc ctgccg 

<210> 2 
<211> 6548 
<212> DNA 

<213> Artificial Sequence 



ctcgtggcta agttctgatc gacatggggt 4020 
tatttgtgca aatctcccca tcgaaaatga 4080 
ctgatagaag tcgcaaagta tcacaatttt 4140 
ggaagcggtt aactggaaca taacacaatg 4200 
gtgtattact ctctccggtc cacaataagt 42 60 
ataagtgagt tttttagatt tcaaaaatga 4320 
gagtaaatgg tgttggagta tgtgttagaa 4380 
tatgatcaat ttcattgcta tttaatgtta 4440 
aaccaaaaaa tcacttattg tggaccggag 4500 
taaaaataaa cttttctcat attatacgaa 4560 
aatctaaatc actaaaatta ataaaagaag 4620 
gtcaaacacg aaattcgtaa atatttaata 4680 
ccctataact taaactaaaa ataaccagcg 4740 
aaatgtgtat aatcatgtat aatcaatgta 4800 
caattaatat agccggctat ttgtgtaaaa 4860 
gaattccggg gaagcttaga tccatggagc 4920 

4946 



<220> 

<223> Description of Artificial Sequence: plasmid PTS172 
<220> 

<221> misc_feature 

<222> Complement ( (2019) (2288) ) 

<223> 3' nos: 3' untranslated region containing the 

polyadenylation signal of the nopaline synthase 
gene of Agrobacterium T-DNA. 

<220> 

<221> misc_feature 

<222> Complement ( (2289) (2624) ) 

<223> barnase: region coding for barnase 

<220> 

<221> misc_feature 

<222> Complement ( (2625) (4313) ) 

<223> PEl : promoter region of El gene of rice 

<220> 

<221> misc_feature 
<222> (4336) . . (5170) 

<223> P35S: 35S promoter region of Cauliflower Mosaic 
Virus 
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<220> 

<221> misc_feature 
<222> Complement ( (5711) (6262) ) 
<223> bar: region coding for phosphinotricin 
ace tyl trans f erase 

<220> 

<221> misc_feature 
<222> (6263) . . (6496) 

<223> 3'g7: 3' untranslated region containing the 

polyadenylation signal of gene 7 of Agrobacterium 
T-DNA 

<400> 2 

aattcaagct tgacgtcagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt 60 
ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg 120 
cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt 180 
cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta 240 
aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc 3 00 
ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa 360 
gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc 420 
cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt 480 
acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact 540 
gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac 600 
aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata 660 
ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta 720 
ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg 780 
gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat 840 
aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt 900 
aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga 960 
aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa 1020 
gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag 1080 
gtgaagatcc tttttggctc gagtctcatg accaaaatcc cttaacgtga gttttcgttc 1140 
cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg 12 00 
cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg 1260 
gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca 1320 
aatactgtcc ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg 1380 
cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg 1440 
tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga 1500 
acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac 1560 
ctacagcgtg agcattgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat 1620 
ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc 1680 
tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga 1740 
tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc 1800 
ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg 1860 
gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag 1920 
cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc 1980 
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gcgcgttggc ctgatcagaa ttcatatgca cgtgttcccg atctagtaac atagatgaca 2040 
ccgcgcgcga taatttatcc tagtttgcgc gctatatttt gttttctatc gcgtattaaa 2100 
tgtataattg cgggactcta atcataaaaa cccatctcat aaataacgtc atgcattaca 2160 
tgttaattat tacatgctta acgtaattca acagaaatta tatgataatc atcgcaagac 2220 
cggcaacagg attcaatctt aagaaacttt attgccaaat gtttgaacga tctgcttcgg 2280 
aggttacctt atctgatttt tgtaaaggtc tgataatggt ccgttgtttt gtaaatcagc 2340 
cagtcgcttg agtaaagaat ccggtctgaa tttctgaagc ctgatgtata gttaatatcc 2400 
gcttcacgcc atgttcgtcc gcttttgccc gggagtttgc cttccctgtt tgagaagatg 2460 
tctccgccga tgcttttccc cggagcgacg tctgcaaggt tcccttttga tgccacccag 2520 
ccgagggctt gtgcttctga ttttgtaatg taattatcag gtagcttatg atatgtctga 2580 
agataatccg caaccccgtc aaacgtgttg ataaccggta ccatcgcgac ggcttgatgg 2640 
atctcttgct ggacaccggg atgctaggat gggttatcgt ggccggcgtg cgtgtgtggc 2700 
ttttgtaggc gccggcgacg gcgggggcaa tgtggcaggt gagtcacggt gcaagcgtgc 2760 
gcaagtgact gcaacaacca aggacggtca tggcgaaagc acctcacgcg tccaccgtct 2820 
acaggatgta gcagtagcac ggtgaaagaa gtgttgtccc gtccattagg tgcattctca 2880 
ccgttggcca gaacaggacc gttcaacagt taggttgagt gtaggacttt tacgtggtta 2940 
atgtatggca aatagtagta aattttgccc ccattggtct ggctgagata gaacatattc 3000 
tggaaagcct ctagcatatc ttttttgaca gctaaacttt gcttcttgcc ttcttggtct 3060 
agcaatgacg ttgcccatgt cgtggcaaac atctggtaag gtaactgtat tcgtttgttc 3120 
ccttcaacgg ctcaatcccc acaggccaag ctatcctttc cttggcagta taggctcctt 3180 
gagagattat actaccattt ttaagtgctt ataaagacga tgctctctaa ccagatcgat 3240 
cagaaacaca aagttttagc agcgtaatat cccacacaca tacacacacg aagctatgcc 3300 
tcctcatttt ccgagagatt ctgacagtga ccagaatgtc agaatgccat ttcatgggca 3360 
caagtcgatc cacaagcttc ttggtggagg tcaaggtgtg ctattattat tcgctttcta 3420 
ggaaattatt cagaattagt gccttttatc ataacttctc tctgagccga tgtggttttg 3480 
gatttcattg ttgggagcta tgcagttgcg gatattctgc tgtggaagaa caggaactta 3540 
tctgcggggg tccttgctgg ggcaacattg atatggttcc tgttcgatgt agtagaatac 3600 
aatataattc cgctcctttg ccagattgcc attcttgcca tgcttgtgat cttcatttgg 3660 
tcaaatgccg caccactctt ggacaggtat tagctttatt tcctgtggag atggtagaaa 372 0 
actcagctta cagaaatggc atttcacgta gtataacgca agacattagg tactaaaact 3780 
caactaactg tttccgaatt tcagggcccc tccaaggatc ccagaaatca tcatctctga 3840 
acatgccttc agagaaatgg cattgaccgt ccattacaaa ctaacgtaca ctgtatctgt 3900 
tctttacgac attgcatgtg gaaaggatct gaagagattt ctcctggtac ataataatct 3960 
actcctttgc tacgttaata agagatgtaa aaacatgcaa cagttccagt gccaacattg 4020 
tccaaggatt gtgcaattct ttctggagcg ctaaaattga ccagattaga cgcatcagaa 4080 
tattgaattg cagagttagc caataatcct cataatgtta atgtgctatt gttgttcact 4140 
actcaatata gttctggact aacaatcaga ttgtttatga tattaaggtg gttggatctc 4200 
tattggtatt gtcggcgatt ggaagttctt gcagcttgac aagtctacta tatattggta 42 60 
ggtattccag ataaatatta aattttaata aaacaatcac acagaaggat ctgcggccgc 4320 
tagcctaggc ccgggcccac aaaaatctga gcttaacagc acagttgctc ctctcagagc 4380 
agaatcgggt attcaacacc ctcatatcaa ctactacgtt gtgtataacg gtccacatgc 4440 
cggtatatac gatgactggg gttgtacaaa ggcggcaaca aacggcgttc ccggagttgc 4500 
acacaagaaa tttgccacta ttacagaggc aagagcagca gctgacgcgt acacaacaag 4560 
tcagcaaaca gacaggttga acttcatccc caaaggagaa gctcaactca agcccaagag 4620 
ctttgctaag gccctaacaa gcccaccaaa gcaaaaagcc cactggctca cgctaggaac 4680 
caaaaggccc agcagtgatc cagccccaaa agagatctcc tttgccccgg agattacaat 4740 
ggacgatttc ctctatcttt acgatctagg aaggaagttc gaaggtgaag gtgacgacac 4800 
tatgttcacc actgataatg agaaggttag cctcttcaat ttcagaaaga atgctgaccc 4860 
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acagatggtt agagaggcct acgcagcagg tctcatcaag acgatctacc cgagtaacaa 4920 
tctccaggag atcaaatacc ttcccaagaa ggttaaagat gcagtcaaaa gattcaggac 4980 
taattgcatc aagaacacag agaaagacat atttctcaag atcagaagta ctattccagt 5040 
atggacgatt caaggcttgc ttcataaacc aaggcaagta atagagattg gagtctctaa 5100 
aaaggtagtt cctactgaat ctaaggccat gcatggagtc taagattcaa atcgaggatc 5160 
taacagaact cgccgtgaag actggcgaac agttcataca gagtctttta cgactcaatg 5220 
acaagaagaa aatcttcgtc aacatggtgg agcacgacac tctggtctac tccaaaaatg 5280 
tcaaagatac agtctcagaa gaccaaaggg ctattgagac ttttcaacaa aggataattt 5340 
cgggaaacct cctcggattc cattgcccag ctatctgtca cttcatcgaa aggacagtag 5400 
aaaaggaagg tggctcctac aaatgccatc attgcgataa aggaaaggct atcattcaag 5460 
atgcctctgc cgacagtggt cccaaagatg gacccccacc cacgaggagc atcgtggaaa 5520 
aagaagacgt tccaaccacg tcttcaaagc aagtggattg atgtgacatc tccactgacg 5580 
taagggatga cgcacaatcc cactatcctt cgcaagaccc ttcctctata taaggaagtt 5640 
catttcattt ggagaggaca cgctgaaatc accagtctct ctctataaat ctatctctct 5700 
ctctataacc atggacccag aacgacgccc ggccgacatc cgccgtgcca ccgaggcgga 5760 
catgccggcg gtctgcacca tcgtcaacca ctacatcgag acaagcacgg tcaacttccg 5820 
taccgagccg caggaaccgc aggagtggac ggacgacctc gtccgtctgc gggagcgcta 5880 
tccctggctc gtcgccgagg tggacggcga ggtcgccggc atcgcctacg cgggcccctg 5940 
gaaggcacgc aacgcctacg actggacggc cgagtcgacc gtgtacgtct ccccccgcca 6000 
ccagcggacg ggactgggct ccacgctcta cacccacctg ctgaagtccc tggaggcaca 6060 
gggcttcaag agcgtggtcg ctgtcatcgg gctgcccaac gacccgagcg tgcgcatgca 6120 
cgaggcgctc ggatatgccc cccgcggcat gctgcgggcg gccggcttca agcacgggaa 6180 
ctggcatgac gtgggtttct ggcagctgga cttcagcctg ccggtaccgc cccgtccggt 6240 
cctgcccgtc accgagatct gagatcacgc gttctaggat cccccgatga gctaagctag 63 00 
ctatatcatc aatttatgta ttacacataa tatcgcactc agtctttcat ctacggcaat 6360 
gtaccagctg atataatcag ttattgaaat atttctgaat ttaaacttgc atcaataaat 6420 
ttatgttttt gcttggacta taatacctga cttgttattt tatcaataaa tatttaaact 6480 
atatttcttt caagatggga attaacatct acaaattgcc ttttcttatc gaccatgtac 6540 
^^^^^^^^ 6548 

<210> 3 
<211> 1601 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: T72 promoter 
region 



<220> 

<221> inisc_f eature 
<222> (1) . . (1601) 

<223> promoter region of T72 gene of rice 



<400> 3 

cgccgtgagt gtcttctgcc gccgaggggc 
gtgcgtgtgt gtcgtggtgg tggtggcgat 
atcgaaggag gggagcgcgc gcggcgaggc 



tctcgctcgt cgtcgatgcc tgcacggtgc 60 
acgcgacgcg agctcgattt ataggagggg 120 
ccgcgttgct cacctacgcc gcgcgcatgc 180 
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ggcggacgcg cggtcggcgc ccgcgccggc cgggaggacg agggcgcaag cgtgtgagcc 240 
accgaacgcg cgcgcgcgcc gcggcgcgaa ctctccatcg cgtcgcggcg agccgagagc 300 
cgacgagagc gtttcgcgcg cgcggttggg ccggcgacaa gatgggccgt agccctgggc 360 
ctcgtgccat cttttttttt cttttttgcc ttttttggcc tggcaatttc tttttgtttt 420 
tagtcttttt gtggtgataa tgtgtcgtct tccggtgaac taatttactc gttgatcttt 480 
ttgtgtccct tcgaatattc gcagtggtag aagatgacta ctactaccag tagttgatct 540 
cgaatggcaa cttttgtgca gaacttattc cacggctatg tcagcttcca ctgtgactaa 600 
aaaaactacg gccatctttt ggacttgttc tatcttggaa ctgaacaaaa aggacgatcc 660 
tgatgtacac acggcatagt ttccagcact ggatgccaag ttgccaactg ttaccacgat 720 
aatggaacga cgagatgaga tattatacaa gtccaatgga tcaagatcct gtgcagttgt 780 
tattgtaact gtaacttaag ccgttaacat gtacatcaca tttcctactc tatcaatgtc 840 
ttgtgcgggt tgtttcaaaa aaacatgtac atcacatgat ctagaacgga aggccaggat 900 
atgaagtggt actgcagcaa aaacactgta gcagagatgt actattatgc atgtactgta 960 
gcagtcatct agagccgttg gatctgaaaa cgaatggaca tgattgtgtg cagttgctat 1020 
tgtgcagtta caatagcaac tgcatttgat cttaatccaa gtccaataca tgcagaacag 1080 
tagctacgag ctggaaagga tgcaaatctg ggtgacactg acagcaaccg tggaagaaca 1140 
acagcagcaa agtcccagag ggatggcaat ttgaaggaat ttaaatactc taatattact 1200 
ccacccgtta aaaaaaacaa cttgctacgc ataatatatg ttcggattta tagcgagaag 1260 
ttaatttttc atgagaagaa gaatatatat gtaatatgta ctaggagagt actcgcttca 1320 
taaatataaa tattcataag ttgtccagtg aagatagctt tagaaaaaac tagttatttt 1380 
atttgtcaaa ttttaaattt tgaagtagtt agattatctt tctagtagtt ctgattggtt 1440 
gaaaatgttt agattttcat gtgttaagag ttccgtatcc taaaaatagt aatataattt 1500 
taaatcatat atatatatat atatatatat atatatatat atatatatat atatatatat 1560 
tgttgaacgg tttgtgctct ggttgctatc ctgttctgtg g 1601 

<210> 4 
<211> 6291 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pVE136 
<220> 

<221> misc_feature 

<222> Complement ( (425) (687) ) 

<223> 3' nos: 3' untranslated region containing the 

polyadenylation signal of the nopaline synthase 
gene of Agrobacterium T-DNA 



<220> 

<221> misc_feature 

<222> Complement ( (803) (1138) ) 

<223> barnase: region coding for barnase 

<220> 

<221> misc^feature 

<222> Complement ( (1138) .. (2317) ) 
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i;3 



i: ^ 




<223> PCa55: s t amen- spec i fic promoter from corn gene 
' CA55 

<220> 

<221> misc_feature 
<222> (2355) . . (3187) 

<223> p35S: 35S promoter region of Cauliflower mosaic 
virus 

<220> 

<221> misc_feature 
<222> (3188) , . (3739) 

<223> bar: region coding for phosphinoacetyl transferase 
<220> 

<221> misc_feature 
<222> (3757) . . (4017) 

<223> 3' nos: 3' untranslated region containing the 

polyadenylation of the nopaline synthase gene of 
Agrobacterium T-DNA 

<220> 

<221> misc_feature 
<222> (699) . . (702) 

<223> region with unknown sequence (may contain up to 15 
nucleotides) 



<400> 4 

tcgcgcgttt cggtgatgac ggtgaaaacc 
cagcttgtct gtaagcggat gccgggagca 
ttggcgggtg tcggggctgg cttaactatg 
accatatgcg gtgtgaaata ccgcacagat 
attcgccatt caggctgcgc aactgttggg 
tacgccagct ggcgaaaggg ggatgtgctg 
tttcccagtc acgacgttgt aaaacgacgg 
cttcccgatc tagtaacata gatgacaccg 
atattttgtt ttctatcgcg tattaaatgt 
atctcataaa taacgtcatg cattacatgt 
gaaattatat gataatcatc gcaagaccgg 
gccaaatgtt tgaacgatct gcttcggatc 
cgatcagagt ttgaagaaaa atttattaca 
ccgcaggaag ccgttttttt cgttatctga 
ttttgtaaat cagccagtcg cttgagtaaa 
tatagttaat atccgcttca cgccatgttc 
tgtttgagaa gatgtctccg ccgatgcttt 
ttgatgccac ccagccgagg gcttgtgctt 
tatgatatgt ctgaagataa tccgcaaccc 
ctgcagctag ttagctcgat gtatcttctg 



tctgacacat gcagctcccg gagacggtca 60 
gacaagcccg tcagggcgcg tcagcgggtg 12 0 
cggcatcaga gcagattgta ctgagagtgc 180 
gcgtaaggag aaaataccgc atcaggcgcc 240 
aagggcgatc ggtgcgggcc tcttcgctat 3 00 
caaggcgatt aagttgggta acgccagggt 3 60 
ccagtgaatt cgagctcggt acccggggat 420 
cgcgcgataa tttatcctag tttgcgcgct 480 
ataattgcgg gactctaatc ataaaaaccc 540 
taattattac atgcttaacg taattcaaca 600 
caacaggatt caatcttaag aaactttatt 660 
ctctagagnn nnccggaaag tgaaattgac 720 
cactttatgt aaagctgaaa aaaacggcct 780 
tttttgtaaa ggtctgataa tggtccgttg 840 
gaatccggtc tgaatttctg aagcctgatg 900 
gtccgctttt gcccgggagt ttgccttccc 960 
tccccggagc gacgtctgca aggttccctt 1020 
ctgattttgt aatgtaatta tcaggtagct 1080 
cgtcaaacgt gttgataacc ggtaccatgg 1140 
tatatgcagt gcagcttctg cgttttggct 1200 
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gctttgagct gtgaaatctc gctttccagt ccctgcgtgt tttatagtgc tgtacgttcg 1260 
tgatcgtgag caaacagggc gtgcctcaac tactggtttg gttgggtgac aggcgccaac 132 0 
tacgtgctcg taaccgatcg agtgagcgta atgcaacatt ttttcttctt ctctcgcatt 1380 
ggtttcatcc agccaggaga cccgaatcga attgaaatca caaatctgag gtacagtatt 1440 
tttacagtac cgttcgttcg aaggtcttcg acaggtcaag gtaacaaaat cagttttaaa 1500 
ttgttgtttc agatcaaaga aaattgagat gatctgaagg acttggacct tcgtccaatg 1560 
aaacacttgg actaattaga ggtgaattga aagcaagcag atgcaaccga aggtggtgaa 1620 
agtggagttt cagcattgac gacgaaaacc ttcgaacggt ataaaaaaga agccgcaatt 1680 
aaacgaagat ttgccaaaaa gatgcatcaa ccaagggaag acgtgcatac atgtttgatg 1740 
aaaactcgta aaaactgaag tacgattccc cattcccctc cttttctcgt ttcttttaac 1800 
tgaagcaaag aatttgtatg tattccctcc attccatatt ctaggaggtt ttggcttttc 1860 
ataccctcct ccatttcaaa ttatttgtca tacattgaag atatacacca ttctaattta 1920 
tactaaatta cagcttttag atacatatat tttattatac acttagatac gtattatata 1980 
aaacacctaa tttaaaataa aaaattatat aaaaagtgta tctaaaaaat caaaatacga 2040 
cataatttga aacggagggg tactacttat gcaaaccaat cgtggtaacc ctaaacccta 2100 
tatgaatgag gccatgattg taatgcaccg tctgattaac caagatatca atggtcaaag 2160 
atatacatga tacatccaag tcacagcgaa ggcaaatgtg acaacagttt tttttaccag 2220 
agggacaagg gagaatatct attcagatgt caagttcccg tatcacactg ccaggtcctt 2280 
actccagacc atcttccggc tctattgatg cataccagga attgatctag agtcgacctg 2340 
caggcatgca agctcctacg cagcaggtct catcaagacg atctacccga gtaacaatct 2400 
ccaggagatc aaataccttc ccaagaaggt taaagatgca gtcaaaagat tcaggactaa 2460 
ttgcatcaag aacacagaga aagacatatt tctcaagatc agaagtacta ttccagtatg 2520 
gacgattcaa ggcttgcttc ataaaccaag gcaagtaata gagattggag tctctaaaaa 2580 
ggtagttcct actgaatcta aggccatgca tggagtctaa gattcaaatc gaggatctaa 2640 
cagaactcgc cgtgaagact ggcgaacagt tcatacagag tcttttacga ctcaatgaca 2700 
agaagaaaat cttcgtcaac atggtggagc acgacactct ggtctactcc aaaaatgtca 2760 
aagatacagt ctcagaagac caaagggcta ttgagacttt tcaacaaagg ataatttcgg 2820 
gaaacctcct cggattccat tgcccagcta tctgtcactt catcgaaagg acagtagaaa 2880 
aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggctatc attcaagatg 2940 
cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc gtggaaaaag 3000 
aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgacatctcc actgacgtaa 3060 
gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa ggaagttcat 3120 
ttcatttgga gaggacacgc tgaaatcacc agtctctctc tataaatcta tctctctctc 3180 
tataaccatg gacccagaac gacgcccggc cgacatccgc cgtgccaccg aggcggacat 3240 
gccggcggtc tgcaccatcg tcaaccacta catcgagaca agcacggtca acttccgtac 3300 
cgagccgcag gaaccgcagg agtggacgga cgacctcgtc cgtctgcggg agcgctatcc 3360 
ctggctcgtc gccgaggtgg acggcgaggt cgccggcatc gcctacgcgg gcccctggaa 3420 
ggcacgcaac gcctacgact ggacggccga gtcgaccgtg tacgtctccc cccgccacca 3480 
gcggacggga ctgggctcca cgctctacac ccacctgctg aagtccctgg aggcacaggg 3540 
cttcaagagc gtggtcgctg tcatcgggct gcccaacgac ccgagcgtgc gcatgcacga 3600 
ggcgctcgga tatgcccccc gcggcatgct gcgggcggcc ggcttcaagc acgggaactg 3660 
gcatgacgtg ggtttctggc agctggactt cagcctgccg gtaccgcccc gtccggtcct 3720 
gcccgtcacc gagatctgat ctcacgcgtc taggatccga agcagatcgt tcaaacattt 3780 
ggcaataaag tttcttaaga ttgaatcctg ttgccggtct tgcgatgatt atcatataat 3840 
ttctgttgaa ttacgttaag catgtaataa ttaacatgta atgcatgacg ttatttatga 3900 
gatgggtttt tatgattaga gtcccgcaat tatacattta atacgcgata gaaaacaaaa 3960 
tatagcgcgc aaactaggat aaattatcgc gcgcggtgtc atctatgtta ctagatcggg 4020 
aagatcctct agagtcgacc tgcaggcatg caagcttggc gtaatcatgg tcatagctgt 4080 
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ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 4140 
agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 4200 
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 4260 
cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 4320 
gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 4380 
ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 4440 
ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 4500 
atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 4560 
aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 4620 
gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc tcacgctgta 4680 
ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 4740 
ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 4800 
acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 4860 
gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat 4920 
ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 4980 
ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 5040 
gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 5100 
ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 5160 
agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 5220 
ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 5280 
gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 5340 
catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 5400 
cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 5460 
cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 5520 
gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 5580 
tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 5640 
gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 5700 
tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 5760 
gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 5820 
gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt 5880 
taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc 5940 
tgttgigatc cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta 6000 
ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa 6060 
taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca 612 0 
tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac 6180 
aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgtctaa gaaaccatta 6240 
ttatcatgac attaacctat aaaaataggc gtatcacgag gccctttcgt c 62 91 

<210> 5 
<211> 5560 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: T-DNA of 
plasmid pTHW142 



<220> 
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<221> misc_f eature 
<222> (1) . . (25) 

<223> RB: right border sequence of octopine TL-DNA from 
pTiB6S3 

<220> 

<221> misc_f eature 
<222> (84) . . (296) 

<223> 3' g7: 3' untranslated region containing the 

polyadenylation signal of gene. 7 of Agrobacterium 
T-DNA 

<220> 

<221> misc_f eature 
<222> (318) . . (869) 

<223> bar: region coding for phosphinotricin 
acetyl transferase 

<220> 

<221> misc_f eature 
<222> (830) . . (2760) 

<223> pSSU: promoter region of Rubisco small subunit 
gene of Arabidopsis thaliana 

<220> 

<221> misc_feature 
<222> (2765) . . (3058) 

<223> 3' untranslated region of the CaMV 35S transcript 
containing polyadenylation signals 

<220> 

<221> misc_feature 
<222> (3059) . . (5056) 

<223> uidA: region coding for beta-glucuronidase 
<220> 

<221> misc_feature 
<222> (4483) . . (4671) 

:223> IV2: region corresponding to the second intron of 
the ST-LSl gene 

220> 

221> misc_feature 
222> (5067) . . (5502) 

223> P35S: 35S promoter region of CaMV 
220> 

221> misc_feature 
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<222> (5533) , . (5560) 

<223> LB: left border sequence of octopine TL-DNA from 
PTIB6S3 

<220> 

<221> inisc_feature 
<222> (5058) . . (5059) 

<223> region with unknovm sequence (may contain up to 20 
nucleotides) 

<220> 

<221> misc_feature 
<222> (5077) . . (5078) 

<223> region with unknown sequence (may contain up to 20 
nucleotides) 

<220> 

<221> misc_feature 
<222> (5476) . . (5479) 

<223> region with unknown sequence (may contain up to 20 
nucleotides) 

<400> 5 

aattacaacg gtatatatcc tgccagtact cggccgtcga gtacatggtc gataagaaaa 60 
ggcaatttgt agatgttaat tcccatcttg aaagaaatat agtttaaata tttattgata 120 
aaataacaag tcaggtatta tagtccaagc aaaaacataa atttattgat gcaagtttaa 180 
attcagaaat atttcaataa ctgattatat cagctggtac attgccgtag atgaaagact 240 
gagtgcgata ttatgtgtaa tacataaatt gatgatatag ctagcttagc tcatcggggg 300 
atcctagacg cgtgagatca gatctcggtg acgggcagga ccggacgggg cggtaccggc 360 
aggctgaagt ccagctgcca gaaacccacg tcatgccagt tcccgtgctt gaagccggcc 420 
gcccgcagca tgccgcgggg ggcatatccg agcgcctcgt gcatgcgcac gctcgggtcg 480 
ttgggcagcc cgatgacagc gaccacgctc ttgaagccct gtgcctccag ggacttcagc 540 
aggtgggtgt agagcgtgga gcccagtccc gtccgctggt ggcgggggga gacgtacacg 600 
gtcgactcgg ccgtccagtc gtaggcgttg cgtgccttcc aggggcccgc gtaggcgatg 660 
ccggcgacct cgccgtccac ctcggcgacg agccagggat agcgctcccg cagacggacg 720 
aggtcgtccg tccactcctg cggttcctgc ggctcggtac ggaagttgac cgtgcttgtc 780 
tcgatgtagt ggttgacgat ggtgcagacc gccggcatgt ccgcctcggt ggcacggcgg 840 
atgtcggccg ggcgtcgttc tgggtccatg cagttaactc ttccgccgtt gcttgtgatg 900 
gaagtaatgt cgttgttagc cttgcgggtg gctgggaagg cagcggagga cttaagtccg 960 
ttgaaaggag cgaccatagt ggcctgagcc ggagaggcaa ccatagtagc ggaagagagc 1020 
atagaggaag ccattgttct tctttactct ttgtgtgact gaggtttggt ctagtgcttt 1080 
ggtcatctat atataatgat aacaacaatg agaacaagct ttggagtgat cggagggtct 1140 
aggatacatg agattcaagt ggactaggat ctacaccgtt ggattttgag tgtggatatg 12 00 
tgtgaggtta attttacttg gtaacggcca caaaggccta aggagaggtg ttgagaccct 12 60 
tatcggcttg aaccgctgga ataatgccac gtggaagata attccatgaa tcttatcgtt 1320 
atctatgagt gaaattgtgt gatggtggag tggtgcttgc tcattttact tgcctggtgg 1380 
acttggccct ttccttatgg ggaatttata ttttacttac tatagagctt tcataccttt 1440 
tttttacctt ggatttagtt aatatataat ggtatgattc atgaataaaa atgggaaatt 1500 
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tttgaatttg tactgctaaa tgcataagat taggtgaaac tgtggaatat atattttttt 1560 
catttaaaag caaaatttgc cttttactag aattataaat atagaaaaat atataacatt 1620 
caaataaaaa tgaaaataag aactttcaaa aaacagaact atgtttaatg tgtaaagatt 1680 
agtcgcacat caagtcatct gttacaatat gttacaacaa gtcataagcc caacaaagtt 1740 
agcacgtcta aataaactaa agagtccacg aaaatattac aaatcataag cccaacaaag 1800 
ttattgatca aaaaaaaaaa acgcccaaca aagctaaaca aagtccaaaa aaaacttctc 1860 
aagtctccat cttcctttat gaacattgaa aactatacac aaaacaagtc agataaatct 1920 
ctttctgggc ctgtcttccc aacctcctac atcacttccc tatcggattg aatgttttac 1980 
ttgtaccttt tccgttgcaa tgatattgat agtatgtttg tgaaaactaa tagggttaac 2 040 
aatcgaagtc atggaatatg gatttggtcc aagattttcc gagagctttc tagtagaaag 2100 
cccatcacca gaaatttact agtaaaataa atcaccaatt aggtttctta ttatgtgcca 2160 
aattcaatat aattatagag gatatttcaa atgaaaacgt atgaatgtta ttagtaaatg 2220 
gtcaggtaag acattaaaaa aatcctacgt cagatattca actttaaaaa ttcgatcagt 2280 
gtggaattgt acaaaaattt gggatctact atatatatat aatgctttac aacacttgga 2340 
tttttttttg gaggctggaa tttttaatct acatatttgt tttggccatg caccaactca 2400 
ttgtttagtg taatactttg attttgtcaa atatatgtgt tcgtgtatat ttgtataaga 2460 
atttctttga ccatatacac acacacatat atatatatat atatatatta tatatcatgc 2520 
acttttaatt gaaaaaataa tatatatata tatagtgcat tttttctaac aaccatatat 2580 
gttgcgattg atctgcaaaa atactgctag agtaatgaaa aatataatct attgctgaaa 2640 
ttatctcaga tgttaagatt ttcttaaagt aaattctttc aaattttagc taaaagtctt 2700 
gtaataacta aagaataata cacaatctcg accacggaaa aaaaacacat aataaatttg 2760 
aattagcttg catgcctgca ggtcactgga ttttggtttt aggaattaga aattttattg 2820 
atagaagtat tttacaaata caaatacata ctaagggttt cttatatgct caacacatga 2880 
gcgaaaccct ataagaaccc taattccctt atctgggaac tactcacaca ttattctgga 2940 
gaaaaataga gagagataga tttgtagaga gagactggtg atttttgcgc cgggtaccga 3 000 
gctcggtagc aattcccgag gctgtagccg acgatggtgc gccaggagag ttgttgattc 3 060 
attgtttgcc tccctgctgc ggtttttcac cgaagttcat gccagtccag cgtttttgca 312 0 
gcagaaaagc cgccgacttc ggtttgcggt cgcgagtgaa gatccctttc ttgttaccgc 3180 
caacgcgcaa tatgccttgc gaggtcgcaa aatcggcgaa attccatacc tgttcaccga 3240 
cgacggcgct gacgcgatca aagacgcggt gatacatatc cagccatgca cactgatact 3300 
cttcactcca catgtcggtg tacattgagt gcagcccggc taacgtatcc acgccgtatt 3360 
cggtgatgat aatcggctga tgcagtttct cctgccaggc cagaagttct ttttccagta 3420 
ccttctctgc cgtttccaaa tcgccgcttt ggacatacca tccgtaataa cggttcaggc 3480 
acagcacatc aaagagatcg ctgatggtat cggtgtgagc gtcgcagaac attacattga 3540 
cgcaggtgat cggacgcgtc gggtcgagtt tacgcgttgc ttccgccagt ggcgaaatat 3600 
tcccgtgcac ttgcggacgg gtatccggtt cgttggcaat actccacatc accacgcttg 3660 
ggtggttttt gtcacgcgct atcagctctt taatcgcctg taagtgcgct tgctgagttt 3720 
ccccgttgac tgcctcttcg ctgtacagtt ctttcggctt gttgcccgct tcgaaaccaa 3780 
tgcctaaaga gaggttaaag ccgacagcag cagtttcatc aatcaccacg atgccatgtt 3840 
catctgccca gtcgagcatc tcttcagcgt aagggtaatg cgaggtacgg taggagttgg 3900 
ccccaatcca gtccattaat gcgtggtcgt gcaccatcag cacgttatcg aatcctttgc 3960 
cacgtaagtc cgcatcttca tgacgaccaa agccagtaaa gtagaacggt ttgtggttaa 4020 
tcaggaactg ttcgcccttc actgccactg accggatgcc gacgcgaagc gggtagatat 4080 
cacactctgt ctggcttttg gctgtgacgc acagttcata gagataacct tcacccggtt 4140 
gccagaggtg cggattcacc acttgcaaag tcccgctagt gccttgtcca gttgcaacca 4200 
cctgttgatc cgcatcacgc agttcaacgc tgacatcacc attggccacc acctgccagt 4260 
caacagacgc gtggttacag tcttgcgcga catgcgtcac cacggtgata tcgtccaccc 4320 
aggtgttcgg cgtggtgtag agcattacgc tgcgatggat tccggcatag ttaaagaaat 4380 



14 



0- 



catggaagta 

tagtctgcca 

tggtcatata 

aattgctata 

taattaatga 

tcccggcaat 

ccatcacttc 

gcacgatacg 

tgcccgcata 

caattgcccg 

cgcgatccag 

ctacaggacg 

atagagagag 

gaggaagggt 

tcacatcaat 

ctcgtgggtg 

tttcctttat 

atgaagtgac 

tgaaaagtct 

atccatggag 



agactgcttt 

gttcagttcg 

ttagaaaagt 

tactacattc 

taattattat 

aacatacggc 

ctgattattg 

ctggcctgcc 

attacgaata 

gctttcttgt 

actgaatgcc 

gaccatgnnc 

actggtgatt 

cttgcgaagg 

ccacttgctt 

ggggtccatc 

cgcaatgatg 
agatagctgg 
caatannnng 
ccatttacaa 



ttcttgccgt 

ttgttcacac 

tataaattaa 

ttttattttg 

atatatatca 

gtgacatcgg 

acccacactt 

caacctttcg 

tctgcatcgg 

aacgcgcttt 

cacaggccgt 

ccggggatcc 

tcagcgtgtc 

atagtgggat 

tgaagacgtg 

tttgggacca 

gcatttgtag 

gcaatggaat 

tcgacctgca 

ttgaatatat 



tttcgtcggt 

aaacggtgat 

aatatacaca 

aaaaaaatat 

aaggtagaag 

cttcaaatgg 

tgccgtaatg 

gtataaagac 

cgaactgatc 

cccaccaacg 

cgagtttttt 

tctaganntt 

ctctccaaat 

tgtgcgtcat 

gttggaacgt 

ctgtcggcag 

gagccacctt 

ccgaggaggt 

ggcatgcaag 

cctgccgccg 



aatcaccatt 

acctgcacat 

cttataaact 

ttgaaatatt 

cagaaactta 

cgtatagccg 

agtgaccgca 

ttcgcgctga 

gttaaaactg 

ctgatcaatt 

gatttcacgg 

atagagagag 

gaaatgaact 

cccttacgtc 

cttctttttc 

aggcatcttg 

ccttttctac 

ttcccgaaat 

ctaattccgg 



cccggcggga 4440 
caccatgttt 4500 
acagaaaagc 4560 
atattactac 4620 
cgtacacttt 4680 
ccctgatgct 4740 
tcgaaacgca 4800 
taccagacgt 4860 
cctggcacag 4920 
ccacagtttt 4980 
gttggggttt 5040 
agatagattt 5100 
tccttatata 5160 
agtggagatg 5220 
cacgatgctc 5280 
aatgatagcc 5340 
tgtcctttcg 5400 
tatcctttgt 5460 
ggaagcttag 5520 
5560 
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SEQUENCE LISTING 



) GENERAL INtORMATION: 

(i) APPLIC 

(A) NASE: PLANT GENETIC SYSTEMS N.V. 

(B) STRiET; Plateaustraat 22 

(C) CITYt Ghent 

(E) COUNTRY: Belgium 

(D POSTa\cODE (ZIP): 9000 

(G) TELEPHONE: 32 9 235 84 58 

(H) TELEFAX\ 32 9 223 19 23 

(I) TELEX: lV.361 Pgsgen 

(ii) TITLE OF INVENMON: Genetic Transformation using a PARP inhibitor 
(iii) NUMBER OF SEQUENCES: 5 

(iv) COMPUTER READABLE ^ORM: 

(A) MEDIUM TYPE: KLoppy disk 

(B) COMPUTER: IBM \c compatible 

(C) OPERATING SYSTE^: PC-DOS/MS-DOS 

(D) SOFTWARE: Paten&[n Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: r\j 

(i) SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 4 94 6 base paXrs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: T-DNA of plasmid pTJ^lO? 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: complement (1..25) 
(D) OTHER INFORMATION: /label= RB 

/note* -T-DNA right border" 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION rconplement (97.. 330) 
(D) OTHER INFORMATION: /label- 3'g7 

, /note- "3 • untranslated Vegion containing the 

polyadenylation signal of gene 7 of Agrobacterium T-DNA " 
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fix) FEATURE: 

(A) UIAME/KEY: - 

(B) LOCATION: complement (331.. 882) 
(D) qTHER INFORMATION: /label= bar 

rransferase" \ /note= "region coding for phosphinothricin acetyl 

(ix) FEATURE: 

(A) NAMfe/ICEY: - 

(B) LOCATION: complement (883.. 2608) 
(D) OTHER INFORMATION: /labels PSSU 

/Aote= "promoter region of Rubisco small subunit aene of 
^'^bidopsis thali,. " ^ 

(ix) FEATURE: 

(A) NAME/lteY: - 

(B) LOCATION :coirplement (2658.. 3031) 
(D) OTHER FORMATION: /label= 3'nos 



(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION :ci 
(D) OTHER INFOF 

/note= 



:>lement (3032. .3367) 
^TION: /label= barnase 
Region coding for barnase' 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION :comp\ement (3368.. 4876) 
(D) OTHER INFORMATION: /label= PTA29 

/note= "proi^oter region of TA29 gene of Nicotiana tabacum' 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION :compledfeJff^ 4922. .4 94 6) 
(D) OTHER INFORMAT4ai3i*&el« LB 

/note= -T-DNA ikft border" 



(xi) SEQUENCE DESCRIPTION: SEQ NO: 1: 
AATTACAACG GTATATATCC TGCCAGTACT CGGfcCGTCGA ACTCGGCCGT 
GTCGATAAGA AAAGGCAATT TGTAGATGTT AATTflCCATC TTGAAAGAAA 
ATATTTATTG ATAAAATAAC AAGTCAGGTA TTATAG^CCA AGCAAAAACA 
GATGCAAGTT TAAATTCAGA AATATTTCAA TAACTGaW TATCAGCTGG 
TAGATGAAAG ACTGAGTGCG ATATTATGTG TAATACAtA ATTGATGATA 

agctcatcgg gggatcctag acgcgtgaga tcagatctcgVtgacgggca 



cgagtacatg 
tatagtttaa 

TAAATTTATT 
TACATTGCCG 
TAGCTAGCTT 
GGACCGGACG 



60 
120 
180 
240 
300 
360 
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GGGCGGTACC GGOfiGGCTGA AGTCCAGCTG CCAGAAACCC ACGTCATGCC AGTTCCCGTG 
CTTGAAGCCG GCCGCCCGCA GCATGCCGCG GGGGGCATAT CCGAGCGCCT CGTGCATGCG 
CACGCTCGGG TCGTTGGGCA GCCCGATGAC AGCGACCACG CTCTTGAAGC CCTGTGCCTC 
CAGGGACTTC AGCAGGTGGG TGTAGAGCGT GGAGCCCAGT CCCGTCCGCT GGTGGCGGGG 
GGAGACGTAC ACGGTCGACT CGGCCGTCCA GTCGTAGGCG TTGCGTGCCT TCCAGGGGCC 
CGCGTAGGCG ATGCCGGC&A CCTCGCCGTC CACCTCGGCG ACGAGCCAGG GATAGCGCTC 
CCGCAGACGG ACGAGGTCGT CCGTCCACTC CTGCGGTTCC TGCGGCTCGG TACGGAAGTT 
GACCGTGCTT GTCTCGATGt\aGTGGTTGAC GATGGTGCAG ACCGCCGGCA TGTCCGCCTC 

ggtggcacgg cggatgtcgg fccgggcgtcg ttctgggtcc attgttcttc tttactcttt 
gtgtgactga ggtttggtct aWgctttgg tcatctatat ataatgataa caacaatgag 

AACAAGCTTT GGAGTGATCG GAfeGGTCTAG GATACATGAG ATTCAAGTGG ACTAGGATCT 
ACACCGTTGG ATTTTGAGTG TGCa^TATGTG TGAGGTTAAT TTTACTTGGT AACGGCCACA 
AAGGCCTAAG GAGAGGTGTT GAGACCCTTA TCGGCTTGAA CCGCTGGAAT AATGCCACGT 
GGAAGATAAT TCCATGAATC TTATcWtAT CTATGAGTGA AATTGTGTGA TGGTGGAGTG 
GTGCTTGCTC ATTTTACTTG CCTGGT&GAC TTGGCCCTTT CCTTATGGGG AATTTATATT 
TTACTTACTA TAGAGCTTTC ATACCTnTT TTTACCTTGG ATTTAGTTAA TATATAATGG 
TATGATTCAT GAATAAAAAT GGGAAATT^\T TGAATTTGTA CTGCTAAATG CATAAGATTA 
GGTGAAACTG TGGAATATAT ATTTTTTTcA TTTAAAAGCA AAATTTGCCT TTTACTAGAA 
TTATAAATAT AGAAAAATAT ATAACATTO^^TAAAAATG AAAATAAGAA CTTTCAAAAA 
ACAGAACTAT GTTTAATGTG TAAAGATTAGTfiGCACATCA AGTCATCTGT TACAATATGT 
TACAACAAGT CATAAGCCCA ACAAAGTTAG CAGGTCTAAA TAAACTAAAG AGTCCACGAA 
AATATTACAA ATCATAAGCC CAACAAAGTT ATTdATCAAA MAPJ\AMAC GCCCAACAAA 
GCTAAACAAA GTCCAAAAAA AACTTCTCAA GTCTcWcT TCCTTTATGA ACATTGAAAA 
CTATACACAA AACAAGTCAG ATAAATCTCT TTCTGGQCCT GTCTTCCCAA CCTCCTACAT 
CACTTCCCTA TCGGATTGAA TGTTTTACTT GTACCTTOTC CGTTGCAATG ATATTGATAG 
TATGTTTGTG AAAACTAATA GGGTTAACAA TCGAAGTCaV GGAATATGGA TTTGGTCCAA 
GATTTTCCGA GAGCTTTCTA GTAGAAAGCC CATCACCAGA WtTACTAG TAAAATAAAT 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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CACCAATTAG GTTTCTTATT yVTGTGCCAAA TTCAATATAA TTATAGAGGA TAT.TTCAAAT 
GAAAACGTAT GAATGTTATT AGTAAATGGT CAGGTAAGAC ATTAAAAAAA TCCTACGTCA 
GATATTCAAC TTTAAAAATT CGATCAGTGT GGAATTGTAC AAAAATTTGG GATCTACTAT 
ATATATATAA TGCTTTACAA cAcTTGGATT TTTTTTTGGA GGCTGGAATT TTTAATCTAC 
ATATTTGTTT TGGCCATGCA CckACTCATT GTTTAGTGTA ATACTTTGAT TTTGTCAAAT 
ATATGTGTTC GTGTATATTT GT^^AAGAAT TTCTTTGACC ATATACACAC ACACATATAT 
ATATATATAT ATATATTATA TATOATGCAC TTTTAATTGA AAAAATAATA TATATATATA 
TAGTGCATTT TTTCTAACAA CCATkTATGT TGCGATTGAT CTGCAAAAAT ACTGCTAGAG 
TAATGAAAAA TATAATCTAT TGCTcWvTT ATCTCAGATG TTAAGATTTT CTTAAAGTAA 
ATTCTTTCAA ATTTTAGCTA AAAGtAtTGT AATAACTAAA GAATAATACA CAATCTCGAC 
CACGGAAAAA AAACACATAA TAAATTTGAA TTTCGACCGC GGTACCCGGA ATTCGAGCTC* 
GGTACCCGGG GATCTTCCCG ATCTACmAC ATAGATGACA CCGCGCGCGA TAATTTATCC 
TAGTTTGCGC GCTATATTTT GTTTTCTATC GCGTATTAAA TGTATAATTG CGGGACTCTA 
ATCATAAAAA CCCATCTCAT AAATAACG^C ATGCATTAC7V TGTTAATTAT TACATGCTTA 
ACGTAATTCA ACAGAAATTA TATGATAAT^ ATCGCAAGAC CGGCAACAGG ATTCAATCTT 
AAGAAACTTT ATTGCCAAAT GTTTGAACGA\ TCTGCTTCGG ATCCTCTAGA GCCGGAAAGT 
GAAATTGACC GATCAGAGTT TGAAGAAAAA yTTATTACAC ACTTTATGTA AAGCTGAAAA 
AAACGGCCTC CGCAGGAAGC CGTTTTTTTC dXTATCTGAT TTTTGTAAAG GTCTGATAAT 
GGTCCGTTGT TTTGTAAATC AGCCAGTCGC T^GAGTAAAG AATCCGGTCT GAATTTCTGA 
AGCCTGATGT ATAGTTAATA TCCGCTTCAC ^^GTJCG TCCGCTTTTG CCCGGGAGTT 
TGCCTTCCCT GTTTGAGAAG ATGTCTCCGC ^cS^TGCTTTT CCCCGGAGCG ACGTCTGCAA 
GGTTCCCTTT TGATGCCACC CAGCCGAGGG CTTgVgCTTC TGATTTTGTA ATGTAATTAT 
CAGGTAGCTT ATGATATGTC TGAAGATAAT CCGCAl^CCCC GTGAAACGTG TTGATAACCG 
GTACCATGGT AGCTAATTTC TTTAAGTAAA AACTTTIgATT TGAGTGATGA TGTTGTACTG 
TTACACTTGC ACCACAAGGG CATATATAGA GCACAAckcA TACACAACAA CTTGCAAAAC 
TAACTTTTGT TGGAGCATTT CGAGGAAAAT GGGGAGtAsC AGGCTAATCT GAGGGTAACA 
TTAAGGTTTC ATGTATTAAT TTGTTGCAAA CATGGACxk GTGTGAGGAA AAAGTACCAA 
AATTTTGTCT CACCCTGATT TCAGTTATGG AAATTACATt ATGAAGCTGT GCTAGAGAAG 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 
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ATGTTTATTC TAGTCcXgCC ACCCACCTTA TGCAAGTCTG CTTTTAGCTT GATTCAAAAA 
CTGATTTAAT TTACATTGCT AAATGTGCAT ACTTCGAGCC TATGTCGCTT TAATTCGAGT 
AGGATGTATA TATTAGTAQA TAAAAAATCA TGTTTGAATC ATCTTTCATA AAGTGACAAG 
TCAATTGTCC CTTCTTGTTTy GGCACTATAT TCAATCTGTT AATGCAAATT ATCCAGTTAT 
ACTTAGCTAG ATATCCAATT YtGAATAAAA ATAGCTCTTG ATTAGTAAAC CGGATAGTGA 
CAAAGTCACA TATCCATCAA AtTTCTGGTG CTCGTGGCTA AGTTCTGATC GACATGGGGT 
TAAAATTTAA ATTGGGACAC ATOAATAGCC TATTTGTGCA AATCTCCCCA TCGAAAATGA 
CAGATTGTTA CATGGAAAAC AArfAAGTCCT CTGATAGAAG TCGCAAAGTA TCACAATTTT 
CTATCGAGAG ATAGATTGAA AGA/^TGCAG GGAAGCGGTT AACTGGAACA TAACACAATG 
TCTAAATTAA TTGCATTCGC TAACC^^AAAA GTGTATTACT CTCTCCGGTC CACAATAAGT 
TATTTTTTGG CCCTTTTTTT ATCGTC^AAA ATAAGTGAGT TTTTTAGATT TCAAAAATGA 
TTTAATTATT TTTTTACTAC AGTGCCCirTG GAGTAAATGG TGTTGGAGTA TGTGTTAGAA 
ATGTTTATGT GAAGAAATAG TAAAGGTTAA TATGATCAAT TTCATTGCTA TTTAATGTTA 
AAATGTGAAT TTCTTAATCT GTGTGAAAaA AACCAAAAAA TCACTTATTG TGGACCGGAG 
AAAGTATATA AATATATATT TGGAAGCGAC VtAAAAATAAA CTTTTCTCAT ATTATACGAA 
CCTAAAAACA GCATATGGTA GTTTCTAGGG ^(VTCTAAATC ACTAAAATTA ATAAAAGAAG 
CAACAAGTAT CAATACATAT GATTTACACC GfCAAACACG AAATTCGTAA ATATTTAATA 
TAATAAAGAA TTAATCCAAA TAGCCTCCCA CC^TAACT TAAACTAAAA ATAACCAGCG 
AATGTATATT ATATGCATAA TTTATATATT ^^CS»STfit AATCAT6TAT AATCAATGTA 
TAATCTATGT ATATGGTTAG AAAAAGTAAA CAAxiftATAT AGCCGGCTAT TTGTGTAAAA 
ATCCCTAATA TAATCGCGAC GGATCCCCGG GAATTd^GGG GAAGCTTAGA TCCATGGAGC 
CATTTACAAT TGAATATATC CTGCCG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



372C 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4946 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SEHSL: NO " 

(vi) ORIGINAl\ SOURCE: 

(A) ORGflNISM: plasmid pTS172 

(ix) FEATURE: 

(A) NAME/kEY: - 

(B) LOCATi^N: complement (2019.. 2288) 
(D) OTHER \:NFORMATION:/label= 3'nos 

/note== "3' untranslated region containing the 
polyadenylation signal of the nopaline synthase gene of Agrobacterium 
T-DNA" ^ 

(ix) FEATURE: 

(A) NAME/KEY:\- 

(B) LOCATION rdomplement (2289.-2624) 
(D) OTHER INFORMATION: /label* barnase 

/note= •Wegion coding for barnase" 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION :comi3aement (2625,. 4313) 
(D) OTHER INFORMATION: /label= PEl 

/note= "prdpoter region of El gene of rice' 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 4336. .sVlO 
(D) OTHER INFORMATION: /labels P35S 

/note= "35S prl^moter region of Cauliflower mosaic virus" 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 5711. .6262 
(D) OTHER INFORMATION: /f 

/note= "rei 

transferase" 



ibel-s bar 

lion coding for phosphinothricin acetyl 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 6263. ,64 96 
(D) OTHER INFORMATION: /label\ 3'g7 

, ^ , . /note- "3' \untranslated region containina the 

polyadenyiatxon signal of gene 7 of Agrobacterium T-DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:\2: 
AATTCAAGCT TGACGTCAGG TGGCACTTTT CGGGGAAATl TGCGCGGAAC CCCTATTTGT 
TTATTTTTCT AAATACATTC AAATATGTAT CCGCTCATGA WcAATAACC CTGATAAATG 
CTTCAATAAT ATTGAAAAAG GAAGAGTATG AGTATTCAAC AtTCCGTGT CGCCCTTATT 



60 
120 
180 
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CCCTTTTTTG CGGCAttTTG CCTTCCTGTT TTTGCTCACC CAGAAACGCT GGTGAAAGTA 
AAAGATGCTG AAGATcVgTT GGGTGCACGA GTGGGTTACA TCGAACTGGA TCTCAACAGC 
GGTAAGATCC TTGAGAGT^T TCGCCCCGAA GAACGTTTTC CAATGATGAG cacttttaaa 
GTTCTGCTAT GTGGCGCGGIV ATTATCCCGT ATTGACGCCG GGCAAGAGCA ACTCGGTCGC 
CGCATACACT ATTCTCAGAA TGACTTGGTT GAGTACTCAC CAGTCACAGA AAAGCATCTT 
ACGGATGGCA TGACAGTAAG ACtAATTATGC AGTGCTGCCA TAACCATGAG TGATAACACT 
GCGGCCAACT TACTTCTGAC AAQGATCGGA GGACCGAAGG AGCTAACCGC TTTTTTGCAC 
AACATGGGGG ATCATGTAAC TCGC^ipTTGAT CGTTGGGAAC CGGAGCTGAA TGAAGCCATA 
CCAAACGACG AGCGTGACAC CACG^^TGCCT GTAGCAATGG CAACAACGTT GCGCAAACTA 
TTAACTGGCG AACTACTTAC TCTAG(3rTCC CGGCAACAAT TAATAGACTG GATGGAGGCG 
GATAAAGTTG CAGGACCACT TCTGCGOTCG GCCCTTCCGG CTGGCTGGTT TATTGCTGAT 
AAATCTGGAG CCGGTGAGCG TGGGTCTC?eC GGTATCATTG CAGCACTGGG GCCAGATGGT 
AAGCCCTCCC GTATCGTAGT TATCTACACfe ACGGGGAGTC AGGCAACTAT GGATGAACGA 
AATAGACAGA TCGCTGAGAT AGGTGCCTCa\ CTGATTAAGC ATTGGTAACT GTCAGACCAA 
GTTTACTCAT ATATACTTTA GATTGATTTA ^VAACTTCATT TTTAATTTAA AAGGATCTAG 
GTGAAGATCC TTTTTGGCTC GAGTCTCATG A^CAAAATCC CTTAACGTGA GTTTTCGTTC 
CACTGAGCGT CAGACCCCGT AGAAAAGATC AAApGATCTT CTTGAGATCC TTTTTTTCTG 
CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCAClfGCTAC CAGCGGTGGT TTGTTTGCCG 
GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAAC^GGCT TCAGCAGAGC GCAGATACCA 
AATACTGTCC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG 
CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTC )CTGCCAGTGG CGATAAGTCG 
TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGAT^f^GCGCAGCG GTCGGGCTGA 
ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGACCTACACCGA ACTGAGATAC 
CTACAGCGTG AGCATTGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT 
CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC 
TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAapGTCG ATTTTTGTGA 
TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGJSCTT TTTACGGTTC 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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1440 
1500 
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1620 
1680 
1740 
1800 
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CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG 1860 
GATAACCGTA TTACCGCCtV TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG 1920 
CGCAGCGAGT CAGTGAGCGA NgGAAGCGGAA GAGCGCCCAA TACGCAAACC GCCTCTCCCC 1980 
GCGCGTTGGC CTGATCAGAA t\cATATGCA CGTGTTCCCG ATCTAGTAAC ATAGATGACA 204 0 
CCGCGCGCGPi TAATTTATCC TA<XtTTGCGC GCTATATTTT GTTTTCTATC GCGTATTAAA 2100 
TGTATAATTG CGGGACTCTA ATCA\rAAAAA CCCATCTCAT AAATAACGTC ATGCATTACA 2160 
TGTTAATTAT TACATGCTTA ACGTAATTCA ACAGAAATTA TATGATAATC ATCGCAAGAC 2220 

CGGCAACAGG ATTCAATCTT AAGAAAOTTT ATTGCCAAAT GTTTGAACGA TCTGCTTCGG 2280 

AGGTTACCTT ATCTGATTTT TGTAAAGCTC TGATAATGGT CCGTTGTTTT GTAAATCAGC 234 0 

CAGTCGCTTG AGTAAAGAAT CCGGTCTGaX TTTCTGAAGC CTGATGTATA GTTAATATCC 2400 

GCTTCACGCC ATGTTCGTCC GCTTTTGCCC \gGGAGTTTGC CTTCCCTGTT TGAGAAGATG 24 60 

TCTCCGCCGA TGCTTTTCCC CGGAGCGACG TCTGCAAGGT TCCCTTTTGA TGCCACCCAG 2520 

CCGAGGGCTT GTGCTTCTGA TTTTGTAATG T«ATTATCAG GTAGCTTATG ATATGTCTGA 2580 

AGATAATCCG CAACCCCGTC AAACGTGTTG ATWACCGGTA CCATCGCGAC GGCTTGATGG 264 0 

ATCTCTTGCT GGACACCGGG ATGCTAGGAT GGGT^ATCGT GGCCGGCGTG CGTGTGTGGC 2700 

TTTTGTAGGC GCCGGCGACG GCGGGGGCAA TGTGCtAGGT GAGTCACGGT GCAAGCGTGC 2760 

GCAAGTGACT GCAACAACCA AGGACGGTCA TGGCGW^AGC ACCTCACGCG TCCACCGTCT 2820 

ACAGGATGTA GCAGTAGCAC GGTGAAAGAA GTGTTGT^CC GTCCATTAGG TGCATTCTCA 2880 

CCGTTGGCCA GAACAGGACC GTTCAACAGT TAGGTTGA^T GTAGGACTTT TACGTGGTTA 294 0 

ATGTATGGCA AATAGTAGTA AATTTTGCCC CCATTGGTC^ GGCTGAGATA GAACATATTC 3000 

TGGAAAGCCT CTAGCATATC TTTTTTGACA GCTAAACTTT VsCTTCTTGCC TTCTTGGTCT 3060 

AGCAATGACG TTGCCCATGT CGTGGCAAAC ATCTGGTAAG Qfl^CTGTAT TCGTTTGTTC 3120 

CCTTCAACGG CTCAATCCCC ACAGGCCAAG CTATCCTTTC |c^?GGCAGTA TAGGCTCCTT 3180 

GAGAGATTAT ACTACCATTT TTAAGTGCTT ATAAAGACGA TG^TCTCTAA CCAGATCGAT 3240 

CAGAAACACA AAGTTTTAGC AGCGTAATAT CCCACACACA TAcl^CACACG AAGCTATGCC 3300 

TCCTCATTTT CCGAGAGATT CTGACAGTGA CCAGAATGTC AGAaVgCCAT TTCATGGGCA 3360 

CAAGTCGATC CACAAGCTTC TTGGTGGAGG TCAAGGTGTG CTATT^TTAT TCGCTTTCTA 3420 

GGAAATTATT CAGAATTAGT GCCTTTTATC ATAACTTCTC TCTGAG^CGA TGTGGTTTTG 3480 
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GATTTCATTG 


TTGGGAGCTA 


tTGCAGTiGCtj 


QjAxAI Iwiw^ 


TGTGGAAGAA 
1 u 1 UunnUnn 


PAGGAAPTTA 




TCTGCGGGGG 


TCCTTGCTGG 


iiyUAAUAl lo 


aTaTftftTTPP 

Al Al uul 1 WW 


TGTTPGATGT 

X U X X WU^^X U X 


AGTAGAATAC 


360C 


AATATAATTC 




^P'nPaTTPPP 


aTTPTTf^PPA 
Al 1 L« 1 1 uWWn 


TGPTTGTGAT 
X UW X X u X unx 


PTT G ATTTGG 
w X X w.^^ XXX uu 


■mJ \J \J \J 


T*/^ 7\ A TV T^rr^r^r" 


PnPPIiP'T'^TT 


pp a ^a/t/^T a T 

ouA^AVsVj 1 A 1 


TAftPTTTATT 
InuWl 1 Ini 1 


TPPTGTGGAG 
X WW X u X uunu 


ATGGTAGAAA 


3720 


AL- 1 uAVjL. i iA 


pa/^aaaTrtftp 

L^AoAAAl ouU^ 


ATTTrAPf^TA 
nl 1 IvL/lWulri 


HTATAACGCA 
0 mxrv%wuwn 


AGACATTAGG 


TACTAAAACT 


3780 




TTTPPftaaTT 
111 Lr^wAnx 1 


TPAf^f^ffiPPPP 


TCCAAGGAT C 


CCAGAAATCA 


TCATCTCTGA 


3840 




arzanAZXATf^f; 

noi^OnnnX ww 


PATTKACPf^T 


CPATTAPAAA 


CTAACGTACA 


CTGTATCTGT 


3900 


TPTTTAPf^AP 


ATTfSPATGTG 


GAAAGGATCT 


GAAGAGATTT 


CTCCTGGTAC 


ATAATAATCT 


3960 


aPTPPTTTftP 


TAPf^TTAATA 


Af^Af^ATGTIVA 


AAACATGCAA 


CAGTTCCAGT 

W^lU X X WW^IU X 


GCCAACATTG 

UWW<^<^WX^X X u 


4020 


Tppa an^^aTT 


fZT^PAATTPT 

0 1 IjW/VlX X W X 


TTPTfZ Af^fTn 
1 1 W 1 uunowu 


PTAAAATTGA 
wx'vwix X un 


CPAGATTAGA 
wwxiuf^x x^iun 


CGCATPAGAA 

WUWf^X W^lU^^f^ 


4080 


Iril 1 uxiril i u 


PAf^AftTTA^^P 
V...>AuAu 1 1 now 


PAATAATPPTA 
wnnlrlril Ww 1 1 


P AT A ATGT T A 
Wsnxnnxux xn 


ATGTGCTATT 
nxuxuwxnx x 


GTTGTTGAPT 

U X X U X X WflW X 


414 0 

*t -X H \J 


AL* 1 Crlrt. 1 nl >i 


PTTPT/^i^aPT 
0 1 1 w 1 uotnL* 1 


a APA ATPAI^A 


Vttgtttatga 

u lux X inl un 


TATTAAGGTG 
mix nnuu X u 


CTTGGATPTP 
u X 1 uunx w X w 


4 ?00 


1 Ai 1 L?o i Al 1 


"P r'P a TT 


r:/^a ar:TTPTT 

uuAAu 1 1 1 1 


rfpa^PTT^taP 
ujWrlVjW 1 1 uriW 


AAflTPTAPTA 
rviu 1 W 1 nW 1 n 


TATATTGGTA 
imm 1 uu m 


4 0 60 


(^(^TZiTTPPa/^ 
uolAl 1 VvOlVj 


AT A A AT ATT A 
Alrvvil nl lA 


a a TTTT A A T A 


AaAPAATPAP 

/VV^WXIX^X WnW 


APAGAAGGAT 
nwnuxviuum x 


W X uwuuwwuw 


4 '^70 


TappPTaci^p 




AAAAATPTfZA 
AAAArll W 1 un 


GP'nTAAPAGP 
UW lU AnWnUW 


APAGTTGPTP 
nwnu 1 1 UW 1 w 


PTPTPAGAGP 
w 1 w 1 wnunuw 


4 "^fiO 




aTTPaapapp 

All Vk-AALfAWVh^ 


PTPATATPAA 


PTAnTAPGTT 
W XnWu rlWU 1 1 


GTGTATAAPG 
u 1 u Inl/viwu 


GTPPAPATGP 
U 1 WWnWnl UW 




pppTaTaTap 

V^owl AlAl 


paTpapTppp 

uA 1 uA\k« 1 uuo 


PTTPTAPA A A 
ul lulAwAAA 


fZfZr*ClfZt* A A P A 


AAPfZ^PGTTP 
An^UuWu 1 1 W 


PPfZfZ A f^TT PP 
^Vh^uuMu 1 1 uL^ 




apapaapaaa 

ACAv-AAwlAA 


TTT^ppapTa 

X 1 IVSUV^AUIA 


TTAPA^A^ftP 


AAf^A(7PA(^PA 
AAuAu V.^/iuV.^ 


GPTGAPGPGT 
u^ 1 uAwowu I 


APAPAAPAaP 
AL*AWAAwAAu 


A 


Tpappaaapa 

1 CA^L-AAAL-A 


fiapaprrTT/^a 

uALfAVaul luA 


aPTTPATPPP 
AW 1 1 VmAI wL-W 


PAAArZGAixAA 
LhAAAu unUfcnn 


GPTPAAPTPA 

U^^ 1 LrAAW 1 C/i 


a^pppaaftap 

Au W W ^ AAu Au 


A COf\ 


^fp rp m TV a r* 
Ul 1 I0L.IAA0 


WW 1 AAuAA 


uuwvJAwUAAA 


ppa a a a apnp 
uwAAAAAuUw 


PaPTPPPTPa 
wAw 1 uu w 1 LrA 


Uuw 1 AuuAAw 


4 DoU 


pa a a aft/^ppp 

^MAAAouU>L«w 


arrpapTfiaTP 

AoL>Aw 1 uA 1 W 


pa^ppppAaa 

L# Au L> W L> WAAA 


Af^AGATPTpA 
nuAuAl L» 1 ^w\ 


TTTGPPPPGG 
1 1 luV-^^V^uu 


artaTTapaaT 

AuAl IAVh^AAI 




GGACGATTTC 


CTCTATCTTT 


ACGATCTAGG 


AAGGAAGTTc/ 


Waggtgaag 


GTGACGACAC 


4800 


TATGTTCACC 


ACTGATAATG 


AGAAGGTTAG 


CCTCTTCAM^ 


;roCAGAAAGA 


ATGCTGACCC 


4860 


ACAGATGGTT 


AGAGAGGCCT 


ACGCAGCAGG 


TCTCATCAAG 


AG3ATCTACC 


CGAGTAACAA 


4920 


TCTCCAGGAG 


ATCAAATACC 


TTCCCAAGAA 


GGTTAAAGAT 


GCAGTCAAAA 


GATTCAGGAC 


4980 


TAATTGCATC 


AAGAACACAG 


AGAAAGACAT 


ATTTCTCAAG 


ATCAGAAGTA 


CTATTCCAGT 


5040 


ATGGACGATT 


CAAGGCTTGC 


TTCATAAACC 


AAGGCAAGTA 


ATAOAGATTG 


GAGTCTCTAA 


5100 
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AAAGGTAGTT CCTACTGAAT frTAAGGCCAT GCATGGAGTC TAAGATTCAA ATCGAGGATC 5160 

TAACAGAACT CGCCGTGAAG ACTGGCGAAC- AGTTCATACA GAGTCTTTTA CGACTCAATG 5220 

ACAAGAAGAA AATCTTCGTC AAC^TCGTCG AGCACGACAC TCTGGTCTAC TCCAAAAATG 5280 

TCAAAGATAC AGTCTCAGAA GACcAaAGGG CTATTGAGAC TTTTCAACAA AGGATAATTT 534 0 

CGGGAAACCT CCTCGGATTC CATTGOCCAG CTATCTGTCA CTTCATCGAA AGGACAGTAG 54 00 

AAAAGGAAGG TGGCTCCTAC AAATGCckxC ATTGCGATAA AGGAAAGGCT ATCATTCAAG 54 60 

ATGCCTCTGC CGACAGTGGT CCCAAAGATG GACCCCCACC CACGAGGAGC ATCGTGGAAA 5520 

AAGAAGACGT TCCAACCACG TCTTCAAAGC AAGTGGATTG ATGTGACATC TCCACTGACG 5580 

TAAGGGATGA CGCACAATCC CACTATCCTt\ CGCAAGACCC TTCCTCTATA TAAGGAAGTT 5640 

CATTTCATTT GGAGAGGACA CGCTGAAATC ACCAGTCTCT CTCTATAAAT CTATCTCTCT 5700 

CTCTATAACC ATGGACCCAG AACGACGCCC GtCCGACATC CGCCGTGCCA CCGAGGCGGA ' 5760 

CATGCCGGCG GTCTGCACCA TCGTCAACCA CTACATCGAG ACAAGCACGG TCAACTTCCG 5820 

TACCGAGCCG CAGGAACCGC AGGAGTGGAC GGflCGACCTC GTCCGTCTGC GGGAGCGCTA 5880 

TCCCTGGCTC GTCGCCGAGG TGGACGGCGA GGTOGCCGGC ATCGCCTACG CGGGCCCCTG 5940 

GAAGGCACGC AACGCCTACG ACTGGACGGC CGAGTCGACC GTGTACGTCT CCCCCCGCCA 6000 

CCAGCGGACG GGACTGGGCT CCACGCTCTA CACCCflCCTG CTGAAGTCCC TGGAGGCACA 6060 

GGGCTTCAAG AGCGTGGTCG CTGTCATCGG GCTGCCaAAC GACCCGAGCG TGCGCATGCA 6120 

CGAGGCGCTC GGATATGCCC CCCGCGGCAT GCTGCGGGCG GCCGGCTTCA AGCACGGGAA 6180 

CTGGCATGAC GTGGGTTTCT GGCAGCTGGA CTTCAGCCTG CCGGTACCGC CCCGTCCGGT 624 0 

CCTGCCCGTC ACCGAGATCT GAGATCACGC GTTCTAGGAt\cCCCCGATGA GCTAAGCTAG 6300 

CTATATCATC AATTTATGTA TTACACATAA TATCGCACTC AGTCTTTCAT CTACGGCAAT 6360 

GTACCAGCTG ATATAATCAG TTATTGAAAT ATTTCTGAAT Tff^CTTGC ATCAATAAAT 6420 

TTATGTTTTT GCTTGGACTA TAATACCTGA CTTGTTATTT *A?C^ TATTTAAACT 6480 

ATATTTCTTT CAAGATGGGA ATTAACATCT ACAAATTGCC TTTtfcTTATC GACCATGTAC 6540 

GTATCGCG \ ^^^g 
(2) INFORMATION FOR SEQ ID NO: 3: \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 1601 base pairs \ 

(B) TYPE: nucleic acid \ 
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(C) STRANpEDNESS : double 

(D) TOPOLQGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

C iii ) HYPOTHETICAL :\ NO 

(iv) ANTI -SENSE: NQ 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: T72 promoter region 

(ix) FEATURE: \ 

(A) NAME/KEY: -\ 

(B) LOCATION: complement {1..1601) 
(D) OTHER INFORMOTION: /labels PT72 

/note= "pr&moter region of T72 gene of rice" 



(xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 3: 













X w^nwww X 




GTGCGTGTGT 


GTCGTGGTGG 


tggtggcgaV 


ACGCGACGCG 


AGCTCGATTT 


ATAGGAGGGG 


120 


ATCGAAGGAG 


GGGAGCGCGC 


GCGGCGAGGC 


VCCGCGTTGCT 


CACCTACGCC 


GCGCGCATGC 


180 


GGCGGACGCG 


CGGTCGGCGC 


CCGCGCCGGC 


QGGGAGGACG 


AGGGCGCAAG 


CGTGTGAGCC 


240 


ACCGAACGCG 


CGCGCGCGCC 


GCGGCGCGAA 


CTCTCCATCG 


CGTCGCGGCG 


AGCCGAGAGC 


300 


CGACGAGAGC 


GTTTCGCGCG 


cgcggttggg 


CCGiSCGACAA 


GATGGGCCGT 


AGCCCTGGGC 


360 


CTCGTGCCAT 


CTTTTTTTTT 


cttttttgcc 


TTTTTTGGCC 


TGGCAATTTC 


TTTTTGTTTT 


420 


TAGTCTTTTT 


GTGGTGATAA 


tgtgtcgtct 


TCCGGIGAAC 


TAATTTACTC 


GTTGATCTTT 


480 


TTGTGTCCCT 


TCGAATATTC 


gcagtggtag 


AAGATGACTA 


CTACTACCAG 


TAGTTGATCT 


540 


CGAATGGCAA 


CTTTTGTGCA 


gaacttattc 


CACGGCTATG 


TCAGCTTCCA 


CTGTGACTAA 


600 


AAAAACTACG 


GCCATCTTTT 


ggacttgttc 


TATCTTGGAA 


CTGAACAAAA 


AGGACGATCC 


660 


TGATGTACAC 


ACGGCATAGT 


ttccagcact 


ggatgccaag\ 


^GCCAACTG 


TTACCACGAT 


720 


AATGGAACGA 


CGAGATGAGA 


tattatacaa 


GTCCAATGG^ 


■tcaagatcct 


GTGCAGTTGT 


780 


TATTGTAACT 


GTAACTTAAG 


ccgttaacat 


GTACATCACA 


TTTCCTACTC 


TATCAATGTC 


840 


TTGTGCGGGT 


TGTTTCAAAA 


AAACATGTAC 


ATCACATGAT 


CTAGAACGGA 


AGGCCAGGAT 


900 


ATGAAGTGGT 


ACTGCAGCAA 


aaacactgta 


GCAGAGATGT 


actAttatgc 


ATGTACTGTA 


960 


GCAGTCATCT 


AGAGCCGTTG 


GATCTGAAAA 


CGAATGGACA 


TGATTGTGTG 


CAGTTGCTAT 


1020 
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15 



1^5 



30 



40 



45 



50 



55 



TGTGCAGTTA CAATAGCAAC \TGCATTTGAT CTTAATCCAA GTCCAATACA TGCAGAACAG 1080 

TAGCTACGAG CTGGAAAGGA l^CAAATCTG GGTGACACTG ACAGCAACCG TGGAAGAACA 114 0 

ACAGCAGCAA AGTCCCAGAG Gci^TGGCAAT TTGAAGGAAT TTAAATACTC TAATATTACT 1200 

CCACCCGTTA AAAAAAACAA CTtVcTACGC ATAATATATG TTCGGATTTA TAGCGAGAAG 1260 

TTAATTTTTC ATGAGAAGAA GAATJ^TATAT GTAATATGTA CTAGGAGAGT ACTCGCTTCA 1320 

TAAATATAAA TATTCATAAG TTGTcfcAGTG AAGATAGCTT TAGAAAAAAC TAGTTATTTT 1380 

ATTTGTCAAA TTTTAAATTT TGAAGT^GTT AGATTATCTT TCTAGTAGTT CTGATTGGTT 14 4 0 

GAAAATGTTT AGATTTTCAT GTGTTAA^gAG TTCCGTATCC TAAAAATAGT AATATAATTT 1500 

TAAATCATAT ATATATATAT ATATATAT^T ATATATATAT ATATATATAT ATATATATAT 1560 

TGTTGAACGG TTTGTGCTCT GGTTGCTAlt CTGTTCTGTG G 1601 
(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTid 

(A) LENGTH: 6291 base p^irs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : ciouble\ 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomi^) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: plasmid pVE136 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION :con^lement (425.^6^7) 
(D) OTHER INFORMATION: /label* 3^05 

/note- "3 'untranslated region containing the 
polyadenylation signal of the nopaline Synthase gene of Agrobacterium 
T-DNA" 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: con?)lement (803.-1138)^ 
(D) OTHER INFORMATION: /label= barnai 

/note«= "region coding for bari^ase' 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION :conq3lement (1138.. 2317) 
(D) OTHER INFORMATION :/label« PCa55 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



/norf= "stamen-specific promoter from corn gene CA55' 

(ix) FEATURE: 

(A) NAME/KEy\ - 

(B) LOCATION: 2355, .3187 
(D) OTHER INFORMATION: /labels P35S 

/note= \35S promoter region of Cauliflower mosaxc virus" 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 3180^. 3739 
(D) OTHER INFORMATION: /label= bar 

/no\e= "region coding for phosphinotricin acetyl 

transferase" 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : 3757. .4flLl7 
(D) OTHER INFORMATION\/label= 3'nos 

/note* "3' untranslated region containing the 

polyadenylation \ ^ ' . 

signal of tt^ nopaline synthase gene of Agrobacterium 

T-DNA" 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 699. .702 

(D) OTHER INFORMATION: /notW "region with unknown 

sequence (may contain up to 15 nucleotides)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID^O: 4: 
TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGAQACAT 
CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGOpCG 
TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCAT 
ACCATATGCG GTGTGAAATA CCGCACAGAT GCGTAAG 
ATTCGCCATT CAGGCTGCGC AACTGTTGGG AAGGGCGAT 
TACGCCAGCT GGCGAAAGGG GGATGTGCTG CAAGGCGAT^ 
TTTCCCAGTC ACGACGTTGT AAAACGACGG CCAGTGAATl 
CTTCCCGATC TAGTAACATA GATGACACCG CGCGCGATAA TT^ 
ATATTTTGTT TTCTATCGCG TATTAAATGT ATAATTGCGG 
ATCTCATAAA TAACGTCATG CATTACATGT TAATTATTAC 
GAAATTATAT GATAATCATC GCAAGACCGG CAACAGGATT 



GAGACGGTCA 
TCAGCGGGTG 
CTGAGAGTGC 
ATCAGGCGCC 
TCTTCGCTAT 
ACGCCAGGGT 
ACCCGGGGAT 
TTTGCGCGCT 
ATAAAAACCC 
TAATTCAACA 
. AAACTTTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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GCCAAATGTT TGAACGATCT GCTrCGGATC CTCTAGAGNN NNCCGGAAAG TGAAATTGAC 
CGATCAGAGT TTGAAGAAAA ATTT&TTACA CACTTTATGT AAAGCTGAAA AAAACGGCCT 
CCGCAGGAAG CCGTTTTTTT CGTTaVcTGA TTTTTGTAAA GGTCTGATAA TGGTCCGTTG 
TTTTGTAAAT CAGCCAGTCG CTTGAOTAAA GAATCCGGTC TGAATTTCTG AAGCCTGATG 
TATAGTTAAT ATCCGCTTCA CGCCATdTTC GTCCGCTTTT GCCCGGGAGT TTGCCTTCCC 
TGTTTGAGAA GATGTCTCCG CCGATGC^TT TCCCCGGAGC GACGTCTGCA AGGTTCCCTT 
TTGATGCCAC CCAGCCGAGG GCTTGTGC^ CTGATTTTGT AATGTAATTA TCAGGTAGCT 
TATGATATGT CTGAAGATAA TCCGCAACcA CGTCAAACGT GTTGATAACC GGTACCATGG 
CTGCAGCTAG TTAGCTCGAT GTATCTTCTG VaTATGCAGT GCAGCTTCTG CGTTTTGGCT 
GCTTTGAGCT GTGAAATCTC GCTTTCCAGT dcCTGCGTGT TTTATAGTGC TGTACGTTCG 

tgatcgtgag caaacagggc gtgcctcaac tActggtttg gttgggtgac aggcgccaac 

TACGTGCTCG TAACCGATCG AGTGAGCGTA AXdCAACATT TTTTCTTCTT CTCTCGCATT 

ggtttcatcc agccaggaga cccgaatcga attWatca CAAATCTGAG gtacagtatt 

TTTACAGTAC C6TTCGTTCG AAGGTCTTCG ACAG^TCAAG GTAACAAAAT CAGTTTTAAA 
TTGTTGTTTC AGATCAAAGA AAATTGAGAT GATClteAAGG ACTTGGACCT TCGTCCAATG 
AAACACTTGG ACTAATTAGA GGTGAATTGA AAGCAAGCAG ATGCAACCGA AGGTGGTGAA 
AGTGGAGTTT CAGCATTGAC GACGAAAACC TTCGAAbGGT ATAAAAAAGA AGCCGCAATT 
AAACGAAGAT TTGCCAAAAA GATGCATCAA CCAAGgAaAG ACGTGCATAC ATGTTTGATG 
AAAACTCGTA AAAACTGAAG TACGATTCCC CATTCCCtTC CTTTTCTCGT TTCTTTTAAC 

tgaagcaaag aatttgtatg tattccctcc attccat;!tt ctaggaggtt ttggcttttc 

ATACCCTCCT CCATTTCAAA TTATTTGTCA TACATTGaAg ATATACACCA TTCTAATTTA 
TACTAAATTA CAGCTTTTAG ATACATATAT TTTATTATa\: ACTTAGATAC GTATTATATA 
AAACACCTAA TTTAAAATAA AAAATTATAT AAAAAGTGTflif^TAAAAAAT CAAAATACGA 
CATAATTTGA AACGGAGGGG TACTACTTAT GCAAACoJjr^GTGGTAACC CTAAACCCTA 
TATGAATGAG GCCATGATTG TAATGCACCG TCTGATTAAC (iRAGATATCA ATGGTCAAAG 
ATATACATGA TACATCCAAG TCACAGCGAA GGCAAATGTG aLaCAGTTT TTTTTACCAG 
AGGGACAAGG GAGAATATCT ATTCAGATGT CAAGTTCCCG TaVcACACTG CCAGGTCCTT 
ACTCCAGACC ATCTTCCGGC TCTATTGATG CATACCAGGA ATtWtCTAG AGTCGACCTG 



■720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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CAGGCATGCA AGCTCCTACG CAGCAGGTCT CATCAAGACG ATCTACCCGA GTAACAATCT 
CCAGGAGATC AAATACCTTC\CCAAGAAGGT TAAAGATGCA GTCAAAAGAT TCAGGACTAA 
TTGCATCAAG AACACAGAGA A^GACATATT TCTCAAGATC AGAAGTACTA TTCCAGTATG 
GACGATTCAA GGCTTGCTTC ATAAACCAAG GCAAGTAATA GAGATTGGAG TCTCTAAAAA 
GGTAGTTCCT ACTGAATCTA AGGCCATGCA TGGAGTCTAA GATTCAAATC GAGGATCTAA 
CAGAACTCGC CGTGAAGACT GGCgAcaGT TCATACAGAG TCTTTTACGA CTCAATGACA 
AGAAGAAAAT CTTCGTC/^C ATGGTGSAGC ACGACACTCT GGTCTACTCC AAAAATGTCA 
AAGATACAGT CTCAGAAGAC CAAAGGGCTA TTGAGACTTT TCAACAAAGG ATAATTTCGG 
GAAACCTCCT CGGATTCCAT TGCCCAGC^A TCTGTCACTT CATCGAAAGG ACAGTAGAAA 
AGGAAGGTGG CTCCTACAAA TGCCATCATil GCGAT7VAAGG AAAGGCTATC ATTCAAGATG 
CCTCTGCCGA CAGTGGTCCC AAAGATGGAC WcCACCCAC GAGGAGCATC GTGGAAAAAG 
AAGACGTTCC AACCACGTCT TCAAAGCAAG TpGATTGATG TGACATCTCC ACTGACGTAA 
GGGATGACGC ACAATCCCAC TATCCTTCGC A^llGACCCTTC CTCTATATAA GGAAGTTCAT 
TTCATTTGGA GAGGACACGC TGAAATCACC AG^CTCTCTC TATAAATCTA TCTCTCTCTC 
TATAACCATG GACCCAGAAC GACGCCCGGC CGA^TCCGC CGTGCCACCG AGGCGGACAT 
GCCGGCGGTC TGCACCATCG TCAACCACTA CATCfiLAGACA AGCACGGTCA ACTTCCGTAC 
CGAGCCGCAG GAACCGCAGG AGTGGACGGA CGACCTCGTC CGTCTGCGGG AGCGCTATCC 
CTGGCTCGTC GCCGAGGTGG ACGGCGAGGT CGCCGG^C GCCTACGCGG GCCCCTGGAA 
GGCACGCAAC GCCTACGACT GGACGGCCGA GTCGACCtfeafe TACGTCTCCC CCCGCCACCA 
GCGGACGGGA CTGGGCTCCA CGCTCTACAC CCAcHSCTG AAGTCCCTGG AGGCACAGGG 
CTTCAAGAGC GTGGTCGCTG TCATCGGGCT GCCCAACGAC CCGAGCGTGC GCATGCACGA 
GGCGCTCGGA TATGCCCCCC GCGGCATGCT GCGGGCGGCc\gGCTTCAAGC ACGGGAACTG 
GCATGACGTG GGTTTCTGGC AGCTGGACTT CAGCCTGCCG CTACCGCCCC GTCCGGTCCT 
GCCCGTCACC GAGATCTGAT CTCACGCGTC TAGGATCCGA AGOAGATCGT TCAAACATTT 

ggcaataaag tttcttaaga ttgaatcctg ttgccggtct tgcgatgatt atcatataat 

TTCTGTTGAA TTACGTTAAG CATGTAATAA TTAACATGTA ATGcAtGACG TTATTTATGA 
GATGGGTTTT TATGATTAGA GTCCCGCAAT TATACATTTA ATACGtGATA GAAAACAAAA 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 
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TATAGCGCGC AAACTAGGAT V^TTATCGC GCGCGGTGTC ATCTATGTTA CTAGATCGGG 
AAGATCCTCT AGAGTCGACC TfccAGGCATG CAAGCTTGGC GTAATCATGG TCATAGCTGT 
TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA CATACGAGCC GGAAGCATAA 
AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC ATTAATTGCG TTGCGCTCAC 
TGCCCGCTTT CCAGTCGGGA AACCTOTCGT GCCAGCTGCA TTAATGAATC GGCCAACGCG 
CGGGGAGAGG CGGTTTGCGT- ATTGGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC 
GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 
CCACAGAATC AGGGGATAAC GCAGGAAAck ACATGTGAGC AAAAGGCCAG CAAAAGGCCA 
GGAACCGTAA AAAGGCCGCG TTGCTGGCGTV TTTTCCATAG GCTCCGCCCC CCTGACGAGC 
ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACAGGACTA TAAAGATACC 
AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG ' 
GATACCTGTC CGCCTTTCTC CCTTCGGGAA GQGTGGCGCT TTCTCAATGC TCACGCTGTA 
GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCWVGCTGGG CTGTGTGCAC GAACCCCCCG 
TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTAtCGTCT TGAGTCCAAC CCGGTAAGAC 
ACGACTTATC GCCACTGGCA GCAGCCACTG GTAA^GGAT TAGCAGAGCG AGGTATGTAG 
GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACtACGG CTACACTAGA AGGACAGTAT 
TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGQAAA AAGAGTTGGT AGCTCTTGAT 
CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTT^GT TTGCAAGCAG CAGATTACGC 
GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTT^C TACGGGGTCT GACGCTCAGT 
GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATtATCAAAAAGG ATCTTCACCT 
AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATOfcfiJrf^ GAGTAAACTT 
GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT QTCAGCGATC TGTCTATTTC 
GTTCATCCAT AGTTGCCTGA CTCCCCGTCG TGTAGATAAC T^GATACGG GAGGGCTTAC 
CATCTGGCCC CAGTGCTGCA ATGATACCGC GAGACCCACG CTckcCGGCT CCAGATTTAT 
CAGCAATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGt\cTGCA ACTTTATCCG 
CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT AAGTA^TCG CCAGTTAATA 
GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTGGT GTCACGfiffCG TCGTTTGGTA 
TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT TACATGATCC CCCATGTTGT 



4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 
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4 920 

4980 

5040 
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5220 
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GGTTAGCTCC .VTTCGGTCCTC 


CGATCGTTGT 


CAGAAGTAAG 


TTGGCCGCAG 




TGTTATCACT 


CATGGTTATG 


GCAGCACTGC 


ATAATTCTCT 


TACTGTCATG 


CCATCCGTAA 




GATGCTTTTC 


TGTGACTGGT 


gm;tactcaa 


CCAAGTCATT 


CTGAGAATAG 


TGTATGCGGC 


5820 


GACCGAGTTG 


CTCTTGCCCG 


GCQTCAATAC 


GGGATAATAC 


CGCGCCACAT 


AGCAGAACTT 


5880 


TAAAAGTGCT 


CATCATTGGA 


AAAQGTTCTT 


CGGGGCGAAA 


ACTCTCAAGG 


ATCTTACCGC 


594 0 


TGTTGAGATC 


CAGTTCGATG 


TAACaCACTC 


GTGCACCCAA 


CTGATCTTCA 


GCATCTTTTA 


6000 


CTTTCACCAG 


CGTTTCTGGG 


TGAGCAAAAA 


CAGGAAGGCA 


AAATGCCGCA 


AAAAAGGGAA 


6060 


TAAGGGCGAC 


ACGGAAATGT 


TGAATACTCA 


TACTCTTCCT 


TTTTCAATAT 


TATTGAAGCA 


6120 


TTTATCAGGG 


TTATTGTCTC 


ATGAGCqGAT 


ACATATTTGA 


ATGTATTTAG 


AAAAATAAAC 


6180 


AAATAGGGGT 


TCCGCGCACA 


TTTCCCCOAA 


AAGTGCCACC 


TGACGTCTAA 


GAAACCATTA 


6240 


TTATCATGAC 


ATTAACCTAT 


AAAAATAG£C 


GTATCACGAG 


GCCCTTTCGT 


C 


6291 


(2) INFORMATION FOR SEQ ID NO: \: 











(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 5560 base j^airs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doubly 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: T-DNA of p3 



Ld PTHW142 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION:!. .25 
(D) OTHER INFORMATION: /label- 

/note* "right border \ sequence of oct opine TL-DNA from 

pTiB6S3" 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: complement (84,. 296) 
(D) OTHER INFORMATION: /label= 3'g7 

/note« ••3' untranslated region containing the 
polyadenylation signal of gene 7 of Agrobacterium T-DNA" 



55 



(ix) FEATURE: 
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rransferase" 



(A) NAfiE/KEY: - 

(B) LOCATION: complement (318.. 869) 
(D) OTH^R INFORMATION: /label= bar 

/note^ "region coding for posphinotricin acety 



(ix) 



(ix) 



(ix) 



FEATURE: 

(A) NAME/KftY: - 

(B) LOCATIO^: complement (830. .2760) 
(D) OTHER INFORMATION: /label= PSSU 

/noteV "promoter region of Rubisco small subunit gene of 
Arabic^opsis thaii, 

FEATURE: 

(A) NAME/KEY; 

(B) LOCATION :c<kplement (2765.-3058) 
(D) OTHER INFORMATION: /labels 3 '355 

/note« untranslated region of the CaMV 35S transcript 

containing polyadenylation signals" 

FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION :coir^lfement (3059.. 5056) 
(D) OTHER INFORMATION: /label= uidA" 

/note* "regiipn coding for beta-glucoronidase" 



(ix) 



FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : complem^t (4483.. 4671) 
(D) OTHER INFORMATIONS /label= IV2 

/note= "regiola corresponding to the second intron of the 
ST-LSl gene" ^ 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : complement \( 5067. .5502) 
(D) OTHER INFORMATION: /libel« P35S 

/note= "35S promot\ar^egion of CaMV" 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 5533. .5560^ 
(D) OTHER INFORMATION: /labelV LB 

PTIB6S3" /note- "left border sequence of octopine TL-DNA from 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 5058. .5059 
(D) OTHER INFORMATION: /note- "r€ 

sequence (may contain up 



?ion with unknown 
to 20 nucleotides)" 



(ix) FEATURE: 

(A) NAME/KEY: - 
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(B) LOCATION: 5077. .5078 

(D) OTHER INFORMATION: /note* "region with unknown 

sequence (may contain up to 20 nucleotides)" 

(ix) FEATURE: \ 

(A) NAME/KEY: V- 

(B) LOCATION: 5476. .5479 

(D) OTHER INFORtlATION:/note= "region with unknown 

sequence \ (may contain up to 20 nucleotides)" 



(xi) SEQUENCE DESCRIPTITpN: SEQ ID NO: 5: 



AATTACAACG 


GTATATATCC 


TGCCAQTACT 


CGGCCGTCGA 


gtacatggtc 


GATAAGAAAA 


60 


GGCAATTTGT 


AGATGTTAAT 


TCCCAT(TrTG 


AAAGAAATAT 


agtttaaata 


TTTATTGATA 


120 


AAATAACAAG 


TCAGGTATTA 


TAGTCCAAGC 


AAAAACATAA 


atttattgat 


GCAAGTTTAA 


180 


ATTCAGAAAT 


ATTTCAATAA 


CTGATTATAT 


CAGCTGGTAC 


attgccgtag 


ATGAAAGACT 


240 


GAGTGCGATA 


TTATGTGTAA 


TACATAAATT 


GATGATATAG 


ctagcttagc 


TCATCGGGGG 


300 


ATCCTAGACG 


CGTGAGATCA 


GATCTCGGTG* 


i ACGGGCAGGA 


ccggacgggg 


CGGTACCGGC 


360 


AGGCTGAAGT 


CCAGCTGCCA 


GAAACCCACG 


yCATGCCZAGT 


tcccgtgctt 


GAAGCCGGCC 


420 


GCCCGCAGCA 


TGCCGCGGGG 


GGCATATCCG 


Agcgcctcgt 


gcatgcgcac 


GCTCGGGTCG 


480 


TTGGGCAGCC 


CGATGACAGC 


GACCACGCTC 


TTGAAGCCCT 


GTGCCTCCAG 


GGACTTCAGC 


540 


AGGTGGGTGT 


AGAGCGTGGA 


GCCCAGTCCC 


gtccgctggt 


GGCGGGGGGA 




con 


GTCGACTCGG 


CCGTCCAGTC 


GTAGGCGTTG 


CGTCCCTTCC 


AGGGGCCCGC 


GTAGGCGATG 


660 


CCGGCGACCT 


CGCCGTCCAC 


CTCGGCGACG 


AGCdAGGGAT 


AGCGCTCCCG 


CAGACGGACG 


720 


AGGTCGTCCG 


TCCACTCCTG 


CGGTTCCTGC 


GGCTCGGTAC 


GGAAGTTGAC 


CGTGCTTGTC 


780 


TCGATGTAGT 


GGTTGACGAT 


GGTGCAGACC 


GCCGGCATGT 


CCGCCTCGGT 


GGCACGGCGG 


840 


ATGTCGGCCG 


GGCGTCGTTC 


TGGGTCCATG 


CAGWffltCTp 


TTCCGCCGTT 


GCTTGTGATG 


900 


GAAGTAATGT 


CGTTGTTAGC 


CTTGCGGGTG 


GC^Tq3;S^GG 


CAGCGGAGGA 


CTTAAGTCCG 


960 


TTGAAAGGAG 


CGACCATAGT 


GGCCTGAGCC 


ggagaggW 


CCATAGTAGC 


GGAAGAGAGC 


1020 


ATAGAGGAAG 


CCATTGTTCT 


TCTTTACTCT 


TTGTGTGACT 


GAGGTTTGGT 


CTAGTGCTTT 


1080 


GGTCATCTAT 


ATATAATGAT 


AACAACAATG 


AGAACAAGC^ 


TTGGAGTGAT 


CGGAGGGTCT 


1140 


AGGATACATG 


AGATTCAAGT 


GGACTAGGAT 


ctacaccgtt\ 


3GATTTTGAG 


TGTGCSATATG 


1200 


TGTGAGGTTA ATTTTACTTG 


GTAACGGCCA 


CAAAGGCCTA i 


NGGAGAGGTG 


TTGAGACCCT 


1260 
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TATCGGCTTG AACCGCTGGA ATAATGCCAC GTGGAAGATA ATTCCATGAA TCTTATCGTT 
ATCTATGAGT GAAATTGTGT >GATGGTGGAG TGGTGCTTGC TCATTTTACT TGCCTGGTGG 
ACTTGGCCCT TTCCTTATGG GGAATTTATA TTTTACTTAC TATAGAGCTT TCATACCTTT 
TTTTTACCTT GGATTTAGTT AATATATAAT GGTATGATTC ATGAATAAAA ATGGGAAATT 
TTTGAATTTG TACTGCTAAA TGffiATAAGAT TAGGTGAAAC TGTGGAATAT ATATTTTTTT 
CATTTAAAAG CAAAATTTGC CTTTTACTAG AATTATAAAT ATAGAAAAAT ATATAACATT 
CAAATAAAAA TGAAAATAAG AACtWcAAA AAACAGAACT ATGTTTAATG TGTAAAGATT 
AGTCGCACAT CAAGTCATCT GTTAOAATAT GTTACAACAA GTCATAAGCC CAACAAAGTT 
AGCACGTCTA AATAAACTAA AGAGTQCACG AAAATATTAC AAATCATAAG CCCAACAAAG 
TTATTGATCA AAAAAAAAAA ACGCCCAACA AAGCTAAACA AAGTCCAAAA AAAACTTCTC 
AAGTCTCCAT CTTCCTTTAT GAACATTGAA AACTATACAC AAAACAAGTC AGATAAATCT 
CTTTCTGGGC CTGTCTTCCC AACCTCCTIaC ATCACTTCCC TATCGGATTG AATGTTTTAC 
TTGTACCTTT TCCGTTGCAA TGATATTgAt AGTATGTTTG TGAAAACTAA TAGGGTTAAC 
AATCGAAGTC ATGGAATATG GATTTGGTCd AAGATTTTCC GAGAGCTTTC TAGTAGAAAG 
CCCATCACCA GAAATTTACT AGTAAAATAa\ ATCACCAATT AGGTTTCTTA TTATGTGCCA 
AATTCAATAT AATTATAGAG GATATTTCAA lATGAAAACGT ATGAATGTTA TTAGTAAATG 
GTCAGGTAAG ACATTAAAAA AATCCTACGT Of^GATATTCA ACTTTAAAAA TTCGATCAGT 
GTGGAATTGT ACAAAAATTT GGGATCTACT A^^TATATAT AATGCTTTAC AACACTTGGA 
TTTTTTTTTG GAGGCTGGAA TTTTTAATCT ACOTATTTGT TTTGGCCATG CACCAACTCA 
TTGTTTAGTG TAATACTTTG ATTTTGTCAA ATATATGTGT TCGTGTATAT TTGTATAAGA 
ATTTCTTTGA CCATATACAC ACACACATAT ATATAPA^AT ATATATATTA TATATCATGC 
ACTTTTAATT GAAAAAATAA TATATATATA TATA€TCb«r TTTTTCTAAC AACCATATAT 
GTTGCGATTG ATCTGCAAAA ATACTGCTAG AGT^AftTOftAA AATAtAATCT ATTGCTGAAA 
TTATCTCAGA TGTTAAGATT TTCTTAAAGT AAATTCT'^^C AAATTTTAGC TAAAAGTCTT 
GTAATAACTA AAGAATAATA CACAATCTCG ACCACGGAA)R AAAAACACAT AATAAATTTG 
AATTAGCTTG CATGCCTGCA GGTCACTGGA TTTTGGTTTtXaGGAATTAGA AATTTTATTG 
ATAGAAGTAT TTTACAAATA CAAATACATA CTAAGGGTTT \tTATATGCT CAACACATGA 
GCGAAACCCT ATAAGAACCC TAATTCCCTT ATCTGGGAAC T;teTCACACA TTATTCTGGA 
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GAAAAATAGA GAGAGATA(^ TTTGTAGAGA GAGACTGGTG ATTTTTGCGC CGGGTACCGA 30CC 

GCTCGGTAGC AATTCCCGAgXgCTGTAGCCG ACGATGGTGC GCCAGGAGAG TTGTTGATTC 3060 

ATTGTTTGCC TCCCTGCTGC GGTTTTTCAC CGAAGTTCAT GCCAGTCCAG CGTTTTTGCA 3120 

GCAGAAAAGC CGCCGACTTC GGf TTGCGGT CGCGAGTGAA GATCCCTTTC TTGTTACCGC 3180 

CAACGCGCAA TATGCCTTGC GAGGTCGCAA AATCGGCGAA ATTCCATACC TGTTCACCGA 324 0 

CGACGGCGCT GACGCGATCA AAGACGCGGT GATACATATC CAGCCATGCA CACTGATACT 3300 

CTTCACTCCA CATGTCGGTG TACATTGAGT GCAGCCCGGC TAACGTATCC ACGCCGTATT 3360 

CGGTGATGAT AATCGGCTGA TGCAGTTTCT CCTGCCAGGC CAGAAGTTCT TTTTCCAGTA 3420 

CCTTCTCTGC CGTTTCCAAA TCGCCGCTTT GGACATACCA TCCGTAATAA CGGTTCAGGC 34 80 

ACAGCACATC AAAGAGATCG CTGATGGTAT CGGTGTGAGC GTCGCAGAAC ATTACATTGA 354 0 

CGCAGGTGAT CGGACGCGTC GGGTCGAGtV TACGCGTTGC TTCCGCCAGT GGCGAAATAT 3600 

TCCCGTGCAC TTGCGGACGG GTATCCGGtA CGTTGGCAAT ACTCCACATC ACCACGCTTG 3660 

GGTGGTTTTT GTCACGCGCT ATCAGCTCTT TAATCGCCTG TAAGTGCGCT TGCTGAGTTT 3720 

CCCCGTTGAC TGCCTCTTCG CTGTACAGTT CTTTCGGCTT GTTGCCCGCT TCGAAACCAA 3780 

TGCCTAAAGA GAGGTTAAAG CCGACAGCAG CAGTTTCATC AATCACCACG ATGCCATGTT 384 0 

CATCTGCCCA GTCGAGCATC TCTTCAGCGT AAGGGTAATG CGAGGTACGG TAGGAGTTGG 3900 

CCCCAATCCA GTCCATTAAT GCGTGGTCGT GCACCATCAG CACGTTATCG AATCCTTTGC 3960 

CACGTAAGTC CGCATCTTCA TGACGACCAA AGCCAGTAAA GTAGAACGGT TTGTGGTTAA 4020 

TCAGGAACTG TTCGCCCTTC ACTGCCACTG ACCGGATCCC GACGCGAAGC GGGTAGATAT 4080 

CACACTCTGT CTGGCTTTTG GCTGTGACGC ACAGTTCffiPff) GAGATAACCT TCACCCGGTT 4140 

GCCAGAGGTG CGGATTCACC ACTTGCAAAG TCCCGCTftGT^CCTTGTCCA GTTGCAACCA 4200 

CCTGTTGATC CGCATCACGC AGTTCAACGC TGACATcKccXaTTGGCCACC ACCTGCCAGT 4260 

CAACAGACGC GTGGTTACAG TCTTGCGCGA CATGCGTCAC ^CGGTGATA TCGTCCACCC 4320 

AGGTGTTCGG CGTGGTGTAG AGCATTACGC TGCGATGGAT TCCGGCATAG TTAAAGAAAT 4380 

CATGGAAGTA AGACTGCTTT TTCTTGCCGT TTTCGTCGGT AATCACCATT CCCGGCGGGA 4 440 

TAGTCTGCCA GTTCAGTTCG TTGTTCACAC AAACGGTGAT ACCTGCACAT CACCATGTTT 4500 

TGGTCATATA TTAGAAAAGT TATAAATTAA AATATACACA CTTATAkACT ACAGAAAAGC 4560 
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AATTGCTATA 


TACTACATTC 


T^TTTATTTTG 


AAAAAAATAT 


ttgaaatatt 


ATATTACTAC 


4620 




TAATTAATGA 


TAATTATTAT 


ATATATATCA 


- AAGGTAGAAG 


CAGAAACTTA 


CGTACACTTT 


4680 


5 


TCCCGGCAAT 


AACATACGGC 


GTGAtATCGG 


CTTCAAATGG 


CGTATAGCCG 


CCCTGATGCT 


4740 




CCATCACTTC 


CTGATTATTG 


ACCCAQACTT 


TGCCGTAATG 


AGTGACCGCA 


TCGAAACGCA 


4800 


10 


GCACGATACG 


CTGGCCTGCC 


CAACCTTTCG 


GTATAAAGAC 


TTCGCGCTGA 


TACCAGACGT 


4860 


TGCCCGCATA 


ATTACGAATA 


TCTGCATCGG 


CGAACTGATC 


GTT/^AAACTG 


CCTGGCACAG 


4920 




CAATTGCCCG 


GCTTTCTTGT 


AACGCGCTT1( 


CCCACCAACG 


CTGATCAATT 


CCACAGTTTT 


4980 


15 


CGCGATCCAG 


ACTGAATGCC 


CACAGGCCGT 


\CGAGTTTTTT 


GATTTCACGG 


GTTGGGGTTT 


5040 




CTACAGGACG 


GACCATGNNC 


CCGGGGATCC 


TCTAGANNTT 


ATAGAGAGAG 


AGATAGATTT 


5100 


20 


ATAGAGAGAG 


ACTGGTGATT 


TCAGCGTGTC 


CTCTCCAAAT 


GAAATGAACT 


TCCTTATATA 


5160 




GAGGAAGGGT 


CTTGCGAAGG 


ATAGTGGGAT 


TGTSCGTCAT 


CCCTTACGTC 


AGTGGAGATG ■ 


5220 




TCACATCAAT 


CCACTTGCTT 


TGAAGACGTG 




CTTCTTTTTC 


CACGATGCTC 


5280 




CTCGTGGGTG 


GGGGTCCATC 


TTTGGGACCA 


CTGTCGGCAG 


AGGCATCTTG 


AATGATAGCC 


5340 




TTTCCTTTAT 


CGCAATGATG 


GCATTTGTAG 


GAGCCACCTT 


CCTTTTCTAC 


TGTCCTTTCG 


5400 


':3o 


ATGAAGTGAC 


AGATAGCTGG 


GCAATGGAAT 


CCGAGGAGGT 


TTCCCGAAAT 


TATCCTTTGT 


5460 


TGAAAAGTCT 


CAATANNNNG 


TCGACCTGCA 


GGCATGCAAQl 


CTAATTCCGG 


GGAAGCTTAG 


5520 




ATCCATGGAG 


CCATTTACAA 


TTGAATATAT 


CCTGCCGCCG 






5560 



